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CHROMOSOME-BASED PLATFORMS 
RELATED APPUCATIONS 

Benefit of priority to U.S. provisional application Serial No. 
60/294,758, filed May 30, 2001, to Perkins, etal.. entitled 
"CHROMOSOME-BASED PLATFORMS" and to U.S. provisional application 
Serial No. 60/366,891, filed March 21, 2002, to Perkins, etal., entitled 
5 "CHROMOSOME-BASED PLATFORMS" is claimed. Where permitted, the 
subject matter of which are herein incorporated by reference in their 
entirety. 

This application is related to Provisional Application No. 
60/294,687, filed May 30, 2001 , by CARL PEREZ AND STEVEN 

10 FABIJANSKI entitled PLANT ARTIFICIAL CHROMOSOMES, USES 
THEREOF AND METHODS FOR PREPARING PLANT ARTIFICIAL 
CHROMOSOMES and to U.S. Provisional Application No. 60/296,329, 
filed June 4, 2001, by CARL PEREZ AND STEVEN FABIJANSKI entitled 
PLANT ARTIFICIAL CHROMOSOMES, USES THEREOF AND METHODS 

1 5 FOR PREPARING PLANT ARTIFICIAL CHROMOSOMES. This application 
also is related to U.S. Provisional Application No. 60/294,758, filed May 
30, 2001, by EDWARD PERKINS etaL. entitled CHROMOSOME-BASED 
PLATFORMS and to U.S. Provisional Application No. 60/366,891, filed 
March 21, 2002, by by EDWARD PERKINS etaL. entitled 

20 CHROMOSOME-BASED PLATFORMS. This application is also related to 
U.S. application Serial Nos. (attorney dkt nos. 24601-419 and 419PC), 
filed on the same day herewith, entitled PLANT ARTIFICIAL 
CHROMOSOMES, USES THEREOF AND METHODS OF PREPARING 
PLANT ARTIFICIAL CHROMOSOMES to Perez etaL. 

25 . This application is related to U.S. application Serial No. 

08/695,191, filed August 7, 1996 by GYULA HADLACZKY and ALADAR 
SZALAY, entitled yA/?7/F/(;/>!»/. CHROMOSOMES, USES THEREOF AND 



METHODS FOR PREPARING ARTIFICIAL CHROMOSOMES, now U.S. 
Patent No. 6,025,155. This application is also related to U.S. application 
Serial No. 08/682,080, filed July 15, 1996 by GYULA HADLACZKY and 
ALADAR SZALAY, entitled ARTIFICIAL CHROMOSOMES, USES THEREOF 
5 AND METHODS FOR PREPARING ARTIFICIAL CHROMOSOMES, now 
U.S. Patent No. 6,077,697. This application is also related U.S. 
application Serial No. 08/629,822, filed April 10, 1996 by GYULA 
HADLACZKY and ALADAR SZALAY, entitled ARTIFICIAL 
CHROMOSOMES, USES THEREOF AND METHODS FOR PREPARING 

10 ARTIFICIAL CHROMOSOMES (now abandoned), and is also related to 
copending U.S. application Serial No. 09/096,648, filed June 12, 1998, 
by GYULA HADLACZKY and ALADAR SZALAY, entitled ARTIFICIAL 
CHROMOSOMES, USES THEREOF AND METHODS FOR PREPARING 
ARTIFICIAL CHROMOSOMES and to U.S. application Serial No. 

15 09/835,682, April 10, 1997 by GYULA HADLACZKY and ALADAR 
SZALAY, entitled ARTIFICIAL CHROMOSOMES, USES THEREOF AND 
METHODS FOR PREPARING ARTIFICIAL CHROMOSOMES (now 
abandoned). This application is also related to copending U.S. application 
Serial No. 09/724,726, filed November 28, 2000, U.S. application Serial 

20 No. 09/724,872, filed November 28, 2000, U.S. application Serial No. 
09/724,693, filed November 28, 2000, U.S. application Serial No. 
09/799,462, filed March 5, 2001, U.S. application Serial No. 
09/836,911, filed April 17, 2001, and U.S. application Serial No. 
10/125,767, filed April 17, 2002, each of which is by GYULA 

25 HADLACZKY and ALADAR SZALAY, and is entitled ARTIFICIAL 

CHROMOSOMES, USES THEREOF AND METHODS FOR PREPARING 
ARTIFICIAL CHROMOSOMES. This application is also related to 
International PCT application No. WO 97/40183. Where permitted the 
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subject matter of each of these provisional applications, international 
applications, and applications Is incorporated by reference in its entirety. 
FIELD OF INVENTION 

Artificial chronnosomes, including ACes, that have been engineered 
5 to contain available sites for site-specific, recombination-directed 

integration of DNA of interest are provided. These artificial chromosomes 
permit tractable, efficient, rational engineering of the chromosome. 
BACKGROUND 

Artificial chromosomes 

10 A variety of artificial chromosomes for use In plants and animals, 

particularly higher plants and animals are available. In particular, U.S. 
Patent Nos. 6,025,155 and 6,077,697 provide heterochromatic artificial 
chromosomes designated therein as satellite artificial chromosomes 
(SATACs) and now designated artificial chromosome expression systems 

15 (ACes). These chromosomes are prepared by introducing heterologous 
DNA into a selected plant or animal cell under conditions that result in 
integration into a region of the chromosome that leads to an amplification 
event resulting in production of a dicentric chromosome. Subsequent 
treatment and growth of cells with dicentric chromosomes, including 

20 further amplifications, ultimately results in the artificial chromosomes 
provided therein. In order to introduce a desired heterologous gene (or a 
plurality of heterologous genes) into the artificial chromosome, the 
process is repeated introducing the desired heterologous genes and 
nucleic acids in the initial targeting step. This process is time consuming 

25 and tedious. Hence, more tractable and efficient methods for introducing 
heterologous nucleic acid molecules into artificial chromosomes, 
particularly >4Ce5, are needed^ 



Therefore, it is an object herein to provide engineered artificial 
chromosomes that permit tractable, efficient and rational engineering of 
artificial chromosomes. 
SUMMARY OF THE INVENTION 
5 Provided herein are artificial chromosomes that permit tractable, 

efficient and rational engineering thereof. In particular, the artificial 
chromosomes provided herein contain one or a plurality of loci (sites) for 
site-specific, recombination-directed integration of DNA. Thus, provided 
herein are platform artificial chromosome expression systems ("platform 

10 ACes") containing single or multiple site-specific, recombination sites. 
The artificial chromosomes and ACes artificial chromosomes include plant 
and animal chromosomes. Any recombinase system that effects site- 
specific recombination is contemplated for use herein. 

In one embodiment, chromosomes, including platform ACes, are 

1 5 provided that contain one or more lambda att sites designed for 

recombination-directed integration in the presence of lambda integrase, 
and that are mutated so that they do not require additional factors. 
Methods for preparing such chromosomes, vectors for use in the 
methods, and uses of the resulting chromosomes are also provided. 

20 Platform ACes containing the recombination site(s) and methods for 

introducing heterologous nucleic acid into such sites and vectors therefor, 
are provided. 

Also provided herein is a bacteriophage lambda lA) integrase site- 
specific recombination system. 
25 Methods using recombinase mediated recombination target gene 

expression vectors and/or genes for insertion thereof into platform 
chromosomes and the resulting chromosomes are provided. 

Combinations and kits containing the combinations of vectors 
encoding a recombinase and integrase and primers for introduction of the 
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site recognized thereby are also provided. The kits optionally include 
instructions for performing site-directed integration or preparation of ACes 
containing such sites. 

Also provided herein are mammalian and plant cells comprising the 
5 artificial chromosomes and ACes described herein. The cells can be 
nuclear donor cells, stem cells, such as a mesenchymal stem cell, a 
hematopoietic stem cell, an adult stem cell or an embryonic stem cell. 

Also provided is a lamba-intR mutein comprising a glutamic acid to 
arginine change at position 174 of wild-type lambda-integraseS. Also 

10 provided are transgenic animals and methods for producing a transgenic 
animal, comprising introducing a ACes into an embryonic cell, such as a 
stem cell or embryo. The ACes can comprise heterologous nucleic acid 
that encodes a therapeutic product. The transgenic animal can be a fish, 
insect, reptile, amphibians, arachnid or mammal. In certain embodiments, 

15 the ACes is introduced by cell fusion, lipid-mediated transfection by a 
carrier system, microinjection, microcell fusion, electroporation, 
microprojectile bombardment or direct DNA transfer. 

The platform ACes, including plant and animal ACes, such as 
MACs, provided herein can be introduced into cells, such as, but not 

20 limited to, animal cells, including mammalian cells, and into plant cells. 
Hence plant cells that contain platform MACs, animal cells that contain 
platform PACs and other combinations of cells and platform ACes are 
provided. 

DESCRIPTION OF FIGURES 

25 FIGURE 1 provides a diagram depicting creation of an exemplary 

ACes artificial chromosome prepared using methods detailed in U.S. 
Patent Nos. 6,025,155 and 6,077,697 and International POT application 
No. WO 97/40183, In this exemplified embodiment, the nucleic acid is 
targeted to an acrocentric chromosome in an animal or plant, and the 



Rectified Sheet (Rule 91} 



-6- 

heterologous nucleic acid includes a sequence-specific recombination site 
and marker genes. 

FIGURE 2 provides a map of pWEPuro9K, which is a targeting 
vector derived from the vector pWE15 (GenBank Accession # X65279; 
5 SEQ ID No. 31). Plasmid pWE15 was modified by replacing the Sail 
(Klenow filled}/S/nal neomycin resistance encoding fragment with the 
Pvull/BamHl (Klenow filled) puromycin resistance-encoding fragment 
(isolated from plasmid pPUR, Clontech Laboratories, Inc., Palo Alto, CA; 
GenBank Accession no. U07648; SEQ ID No. 30) resulting in plasmid 
10 pWEPuro. Subsequently a 9 Kb Nofl fragment from the plasmid pFKI 61 
(see Example 1 , see, also Csonka et aL (2000) Journal of Cell Science 
773:3207-32161; and SEQ ID NO: 118), containing a portion of the 
mouse rDNA region, was cloned into the No^ site of pWEPuro resulting in 
plasmid pWEPuroSK. 
15 FIGURE 3 depicts construction of an ACes platform chromosome 

with a single recombination site, such as loxP sites or an attP or attQ site. 
This platform ACes chromosome is an exemplary artificial chromosome 
with a single recombination site. 

FIGURE 4 provides a map of plasmid pSV40-193attPsensePur. 
20 FIGURE 5 depicts a method for formation of a chromosome 

platform with multiple recombination integration sites, such as attP sites. 

FIGURE 6 sets forth the sequences of the core region of attP, a«B, 
am. and am (SEQ ID Nos. 33-36). 

FIGURE 7 depicts insertional recombination of a vector encoding a 
25 marker gene, DsRed and an affB site with an artificial chromosome 
containing an atlP site. 

FIGURE 8 provides a map of plasmid pCXLamlntR (SEQ ID NO: 
112), which includes the Lambda integrase (El 74R)-encoding nucleic 
acid. 
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FIGURE 9 diagrammatically summarizes the platform technology; 
marker 1 permits selection of the artificial chrornosomes containing the 
integration site; marker 2, which is promoterless in the target gene 
expression vector, permits selection of recombinants. Upon 
5 recombination with the platform marker 2 is expressed under the control 
of a promoter resident on the platform. 

FIGURE 10 provides the vector map for the plasmid pISattBZEO- 
5'6XHS4eGFP (SEQ ID NO: 116). 

FIGURE 1 1 provides the vector map for the plasmid pISattBZEO- 
10 3'6XHS4eGFP (SEQ ID NO: 115). 

FIGURE 12 provides the vector map for the plasmid pISattBZEO- 
{6XHS4)2eGFP {SEQ ID NO: 1 10). 

FIGURES 13 AND 14 depict the integration of a PGR product by 
site-specific recombination as set forth in Example 8. 
15 FIGURE 15 provides the vector map for the plasmid pPACrDNA as 

set forth in Example 9. A. 
DETAILED DESCRIPTION OF THE INVENTION 
A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used 
20 herein have the same meaning as is commonly understood by one of skill 
in the art to which the invention(s) belong. All patents, patent 
applications, published applications and publications, Genbank sequences, 
websites and other published materials referred to throughout the entire 
disclosure herein, unless noted otherwise, are incorporated by reference 
25 in their entirety. Where reference is made to a URL or other such 

indentifier or address, it understood that such identifiers can change and 
particular information on the internet can come and go, but equivalent 
information can be found by searching the internet. Reference thereto 
evidences the availability and public dissemination of such information. 



As used herein, nucleic acid refers to single-stranded and/or 
double-stranded polynucleotides, such as deoxyribonucleic acid (DNA) 
and ribonucleic acid (RNA), as well as analogs or derivatives of either 
RNA or DNA. Also included in the term ''nucleic acid" are analogs of 
5 nucleic acids such as peptide nucleic acid (PNA), phosphorothioate DNA, 
and other such analogs and derivatives. When referring to probes or 
primers, optionally labeled, with a detectable label, such as a fluorescent 
or radiolabel, single-stranded molecules are contemplated. Such 
molecules are typically of a length such that they are statistically unique 

10 and of low copy number (typically less than 5, preferably less than 3) for 
probing or priming a library. Generally a probe or primer contains at least 
14; 16 or 30 contiguous nucleotides of sequence complementary to or 
identical to a gene of interest. Probes and primers can be 1 0, 20, 30, 50, 
100 or more nucleotides long. 

15 As used herein, DNA is meant to include all types and sizes of DNA 

molecules including cDNA, plasmids and DNA including modified 
nucleotides and nucleotide analogs. 

As used herein, nucleotides include nucleoside mono-, di-, and 
triphosphates. Nucleotides also include modified-nucleotides, such as, 

20 but are not limited to, phosphorothioate nucleotides and deazapurine 
nucleotides and other nucleotide analogs. 

As used herein, heterologous or foreign DNA and RNA are used 
interchangeably and refer to DNA or RNA that does not occur naturally as 
part of the genome in which it is present or which is found in a location 

25 or locations and/or in amounts in a genome or cell that differ from that in 
which it occurs in nature. Heterologous nucleic acid is generally not 
endogenous to the cell into which it is introduced, but has been obtained 
from another cell or prepared synthetically. Generally, although not 
necessarily, such nucleic acid encodes RNA and proteins that are not 



-9- 

normally produced by the cell in which it is expressed. Any DNA or RNA 
that one of skill in the art would recognize or consider as heterologous or 
foreign to the cell in which it is expressed is herein encompassed by 
heterologous DNA. Heterologous DNA and RNA may also encode RNA or 
5 proteins that mediate or alter expression of endogenous DNA by affecting 
transcription, translation, or other regulatabie biochemical processes. 

Examples of heterologous DNA include, but are not limited to, DNA 
that encodes a gene product or gene product(s) of interest, introduced for 
purposes of modification of the endogenous genes or for production of an 

10 encoded protein. For example, a heterologous or foreign gene may be 
isolated from a different species than that of the host genome, or 
alternatively, may be isolated from the host genonne but operably linked 
to one or more regulatory regions which differ from those found in the 
unaltered, native gene. Other examples of heterologous DNA include, but 

1 5 are not limited to, DNA that encodes traceable marker proteins, such as a 
protein that confers traits including, but not limited to, herbicide, insect, 
or disease resistance; traits, including, but not limited to, oil quality or 
carbohydrate composition. Antibodies that are encoded by heterologous 
DNA may be secreted or expressed on the surface of the cell in which the 

20 heterologous DNA has been introduced. 

As used herein, operative linkage or operative association, or 
grammatical variations thereof, of heterologous DNA to regulatory and 
effector sequences of nucleotides, such as promoters, enhancers, 
transcriptional and translational stop sites, and other signal sequences 

25 refers to the relationship between such DNA and such sequences of 
nucleotides. For example, operative linkage of heterologous DNA to a 
promoter refers to the physical relationship between the DNA and the 
promoter such that the transcription of such DNA is initiated from the 
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promoter by an RNA polymerase that specifically recognizes, binds to and 
transcribes the DNA. 

In order to optimize expression and/or in vitro transcription, it may 
be necessary to remove, add or alter 5' untranslated portions of the 
5 clones to eliminate extra, potential inappropriate alternative translation 
initiation {i.e., start) codons or other sequences that may interfere with or 
reduce expression, either at the level of transcription or translation. 
Alternatively, consensus ribosome binding sites (see, e.g., Kozak (1991) 
J. Biol. Ciiem. 266:19867-19870) can be inserted immediately 5' of the 

10 start codon and may enhance expression. 

As used herein, a sequence complementary to at least a portion of 
an RNA, with reference to antisense oligonucleotides, means a sequence 
having sufficient complementarity to be able to hybridize with the RNA, 
preferably under moderate or high stringency conditions, forming a stable 

15 duplex. The ability to hybridize depends on the degree of 

complementarity and the length of the antisense nucleic acid. The longer 
the hybridizing nucleic acid, the more base mismatches it can contain and 
still form a stable duplex (or triplex, as the case may be). One skilled in 
the art can ascertain a tolerable degree of mismatch by use of standard 

20 procedures to determine the melting point of the hybridized complex. 
As used herein, regulatory molecule refers to a polymer of 
deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) or a polypeptide 
that is capable of enhancing or inhibiting expression of a gene. 

As used herein, recognition sequences are particular sequences of 

25 nucleotides that a protein, DNA, or RNA molecule, or combinations 
thereof, (such as, but not limited to, a restriction endonuclease, a 
modification methylase and a recombinase) recognizes and binds. For 
example, a recognition sequence for Cre recombinase (see, e.g., SEQ ID 
NO:58) is a 34 base pair sequence containing two 13 base pair inverted 
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repeats (serving as the recombinase binding sites) flanking an 8 base pair 
core and designated loxP (see, e.g., Sauer (1994) Current Opinion in , 
Biotectinology 5:521-527). Other examples of recognition sequences, ^ 
include, but are not limited to, attB and attP, a«R and attL and others 
5 (see, e.g., SEQ ID Nos. 8, 41-56 and 72), that are recognized by the 
recombinase enzyme Integrase (see,. SEQ ID Nos. 37 and 38 for the 
nucleotide and encoded amino acid sequences of an exemplary lambda 
phage integrase). 

The recombination site designated affB is an approximately 33 base 
10 pair sequence containing two 9 base pair core-type Int binding sites and a 
7 base pair overlap region; affP (SEQ ID No. 72) is an approximately 240 
base pair sequence containing core-type Int binding sites and arm-type Int 
binding sites as well as sites for auxiliary proteins IHF, FIS, and Xis (see, 
e.g., Landy (1993) Current Opinion in Bioteciinoiogy 3:699-7071 see, 
15 e.g., SEQ ID Nos. 8 and 72). 

As used herein, a recombinase is an enzyme that catalyzes the 
exchange of DNA segments at specific recombination sites. An integrase 
herein refers to a recombinase that is a member of the lambda (A) 
integrase family. 

20 As used herein, recombination proteins include excisive proteins, 

integrative proteins, enzymes, co-factors and associated proteins that are 
Involved in recombination reactions using one or more recombination sites 
(see, Landy (1993) Current Opinion in Bioteciinoiogy 3:699-707). The 
recombination proteins used herein can be delivered to a cell via an 

25 expression cassette on an appropriate vector, such as a plasmid, and the 
iilce. In other embodiments, the recombination proteins can be delivered 
to a cell in protein form in the same reaction mixture used to deliver the 
desired nucleic acid, such as a platform ACes, donor target vectors, and 
the like. 
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As used herein the expression "lox site" nneans a sequence of 
nucleotides at which the gene product of the ere gene, referred to 
herein as Cre, can catalyze a site-specific recombination event. A LoxP 
site is a 34 base pair nucleotide sequence from bacteriophage PI (see, 
5 e.g., Hoess etal. (1982) Proc. Natl. Acad. Sci. U.S.A. 75:3398-3402). 
The LoxP site contains two 1 3 base pair inverted repeats separated by an 
8 base pair spacer region as follows: (SEQ ID NO. 57): 

ATAACTTCGTATA ATGTATGC TATACGAAGTTAT 
E co//DH5Alac and yeast strain BSY23 transformed with plasmid pBS44 

10 carrying two loxP sites connected with a LEU2 gene are available from 
the American Type Culture Collection (ATCC) under accession numbers 
ATCC 53254 and ATCC 20773, respectively. The lox sites can be 
isolated from plasmid pBS44 with restriction enzymes EcoR\ and Sail, or 
Xho\ and BdmH\. In addition, a preselected DNA segment can be inserted 

1 5 into pBS44 at either the Sa/1 or BamH\ restriction enzyme sites. Other lox 
sites include, but are not limited to, LoxB, LoxL, LoxC2 and LoxR sites, 
which are nucleotide sequences isolated from E. coli (see, e.g., IHoess et 
al. (1982) Proc. Natl. Acad. Sci. U.S.A. 75:3398). Lox sites can also be 
produced by a variety of synthetic techniques (see, e.g., Ito etal. (1982) 

20 Nuc. Acid Res. /0:1755 and Ogilvie ef aA (1981) Science 270\n0). 

As used herein, the expression "ere gene" means a sequence of 
nucleotides that encodes a gene product that effects site-specific 
recombination of DNA in eukaryotic cells at lox sites. One ere gene can 
be isolated from bacteriophage PI (see, e.g., Abremski et al. (1983) Cell 

25 52:1301-131 1). £ coli DH1 and yeast strain BSY90 transformed with 
plasmid pBS39 carrying a ere gene isolated from bacteriophage PI and a 
GAL1 regulatory nucleotide sequence are available from the American 
Type Culture Collection (ATCC) under accession numbers ATCC 53255 
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and ATCC 20112, respectively. The ere gene can be isolated from 
plasmid pBS39 with restriction enzymes Xho\ and Sal\. 

As used herein, site-specific recombination refers to site-specific 
recombination that is effected between two specific sites on a single 
5 nucleic acid molecule or between two different molecules that requires 
the presence of an exogenous protein, such as an integrase or 
recombinase. 

For example, Cre-lox site-specific recombination can include the 
following three events: 

10 a. deletion of a pre-selected DNA segment flanked by lox 

sites; 

b. inversion of the nucleotide sequence of a pre-selected 
DNA segment flanked by lox sites; and 

c. reciprocal exchange of DNA segments proximate to 
15 lox sites located on different DNA molecules. 

This reciprocal exchange of DNA segments can result in an 
integration event if one or both of the DNA molecules are circular. DNA 
segment refers to a linear fragment of single- or double-stranded 
deoxyribonucleic acid (DNA), which can be derived from any source. 

20 Since the lox site is an asymmetrical nucleotide sequence, two lox sites 
on the same DNA molecule can have the same or opposite orientations 
with respect to each other. Recombination between lox sites in the same 
orientation results in a deletion of the DNA segment located between the 
two lox sites and a connection between the resulting ends of the original 

25 DNA molecule. The deleted DNA segment forms a circular molecule of 
DNA. The original DNA molecule and the resulting circular molecule each 
contain a single lox site. Recombination between lox sites in opposite 
orientations on the same DNA molecule result in an inversion of the 
nucleotide sequence of the DNA segment located between the two lox 
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sites. In addition, reciprocal exchange of DNA segments proximate to lox 
sites located on two different DNA molecules can occur. All of these 
recombination events are catalyzed by the gene product of the ere gene. 
Thus, the Cre-lox system can be used to specifically delete. Invert, or 
5 insert DNA. The precise event is controlled by the orientation of lox DNA 
sequences, in cis the lox sequences direct the Cre recombinase to either 
delete (lox sequences in direct orientation) or invert (lox sequences in 
inverted orientation) DNA flanked by the sequences, while in trans the lox 
sequences can direct a homologous recombination event resulting in the 

10 insertion of a recombinant DNA. 

As used herein, a chromosome is a nucleic acid molecule, and 
associated proteins, that is capable of replication and segregation within a 
cell upon cell division. Typically, a chromosome contains a centromeric 
region, replication origins, telomeric regions and a region of nucleic acid 

15 between the centromeric and telomeric regions. 

As used herein, a centromere is any nucleic acid sequence that 
confers an ability to segregate to daughter cells through cell division, A 
centromere may confer stable segregation of a nucleic acid sequence, 
including an artificial chromosome containing the centromere, through 

20 mitotic or meiotic divisions, including through both mitotic and meiotic 
divisions. A particular centromere is not necessarily derived from the 
same species in which it is introduced, but has the ability to promote 
DNA segregation in cells of that species. ' 

As used herein, euchromatin and heterochromatin have their 

25 recognized meanings. Euchromatin refers to chromatin that stains 

diffusely and that typically contains genes, and heterochromatin refers to 
chromatin that remains unusually condensed and that has been thought to 
be transcriptionally inactive. Highly repetitive DNA sequences (satellite 
DNA) are usually located in regions of the heterochromatin surrounding 
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the centromere (pericentric or pericentromeric heterochromatin). 
Constitutive heterochromatin refers to heterochromatin that contains the 
highly repetitive DNA which is constitutlvely condensed and genetically 
inactive. 

5 As used herein, an acrocentric chromosome refers to a 

chromosome with arms of unequal length. 

As used herein, endogenous chromosomes refer to genomic chrom- 
osomes as found in a cell prior to generation or introduction of an artificial 
chromosome. 

10 As used herein, artificial chromosomes are nucleic acid molecules, 

typically DNA, that stably replicate and segregate alongside endogenous 
chromosomes In cells and have the capacity to accommodate and express 
heterologous genes contained therein. It has the capacity to act as a 
gene delivery vehicle by accommodating and expressing foreign genes 

15 contained therein. A mammalian artificial chromosome (MAC) refers to 
chromosomes that have an active mammalian centromere (s). Plant 
artificial chromosomes, insect artificial chromosomes and avian artificial 
chromosomes refer to chromosomes that include centromeres that 
function in plant, inseict and avian cells, respectively. A human artificial 

20 chromosome (HAC) refers to chromosomes that include centromeres that 
function in human cells. For exemplary artificial chromosomes, see, e.g., 
U.S. Patent Nos. 6,025,155; 6,077,697; 5,288,625; 5,712,134; 
5,695,967; 5,869,294; 5,891,691 and 5,721,118 and published 
International PCT application Nos, WO 97/40183 and WO 98/08964. 

25 Artificial chromosomes include those that are predominantly 

heterochromatic (formerly referred to as satellite artificial chromosomes 
ISATACs); see, e.g., U.S. Patent Nos. 6,077,697 and 6,025,155 and 
published International PCT application No. WO 97/40183), 
minichromosomes that contain a de novo centromere (see, U.S. Patent 
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Nos. 5,712,134, 5,891,691 and 5,288,625), artificial chromosomes 
predominantly made up of repeating nucleic acid units and that contain 
substantially equivalent amounts of euchromatic and heterochromatic 
DNA and in vitro assembled artificial chromosomes {see, copending U.S. 
5 provisional application Serial No. 60/294,687, filed on May 30, 2001). 
As used herein, the term "satellite DNA-based artificial 
chromosome (SATAC)" Is interchangable with the term "artificial 
chromosome expression system (ACesV, These artificial chromosomes 
[ACes) include those that are substantially all neutral non-coding 

10 sequences (heterochromatin) except for foreign heterologous, typically 
gene-encoding nucleic acid, that is interspersed within the 
heterochromatin for the expression therein (see U.S. Patent Nos, 
6,025,155 and 6,077,697 and International PCT application No. WO 
97/40183), or that is in a single locus as provided herein. Also included 

15 are ACes that may include euchromatin and that result from the process 
described in U.S. Patent Nos. 6,025,155 and 6,077,697 and International 
PCT application No. WO 97/40183 and outlined herein. The delineating 
structural feature is the presence of repeating units, that are generally 
predominantly heterochromatin. The precise structure of the ACes will 

20 depend upon the structure of the chromosome In which the Initial 

amplification event occurs; all share the common feature of including a 
defined pattern of repeating units. Generally ACes have more 
heterochromatin than euchromatin. Foreign nucleic acid molecules 
(heterologous genes) contained in these artificial chromosome expression 

25 systems can include any nucleic acid whose expression is of interest in a 
particular host cell. Such foreign nucleic acid molecules, include, but are 
not limited to, nucleic acid that encodes traceable marker proteins 
(reporter genes), such as fluorescent proteins, such as green, blue or red 
fluorescent proteins (GFP, BFP and RFP, respectively), other reporter 
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genes, such as )8-galactosidase and proteins that confer drug resistance, 
such as a gene encoding hygromycin-resistance. Other examples of 
heterologous nucleic acid molecules include, but are not limited to, DNA 
that encodes therapeutically effective substances, such as anti-cancer 
5 agents, enzymes and hormones, DNA that encodes other types of 

proteins, such as antibodies, and DNA that encodes RNA molecules (such 
as antisense or siRNA molecules) that are not translated into proteins. 

As used herein, an artificial chromosome platform, also referred to 

10 herein as a "platform ACes'' or MCes platform", refers to an artificial 
chromosome that has been engineered to include one or more sites for 
site-specific, recombination-directed integration. In particular, ACes that 
are so-engineered are provided. Any sites, including but not limited to 
any described herein, that are suitable for such integration are 

15 contemplated. Plant and animal platform ACes are provided. Among the 
ACes contemplated herein are those that are predominantly 
heterochromatic (formerly referred to as satellite artificial chromosomes 
(SATACs); see, e.g., U.S. Patent Nos. 6,077,697 and 6,025,155 and 
published International PCT application No. WO 97/40183), artificial 

20 chromosomes predominantly made up of repeating nucleic acid units and 
that contain substantially equivalent amounts of euchromatic and 
heterochromatic DNA resulting from an amplification event depicted in the 
referenced patent and herein. Included among the ACes for use in 
generating platforms, are artificial chromosomes that introduce and 

25 express heterologous nucleic acids in plants (see, copending U.S. 

provisional application Serial No. 60/294,687, filed on May 30, 2001). 
These include artificial chromosomes that have a centromere derived from 
a plant, and, also, artificial chromosomes that have centromeres that may 
be derived from other organisms but that function in plants. 
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As used herein a "reporter ACes"" refers to an ACes that comprises 
one or a plurality of reporter constructs, where the reporter construct 
comprises a reporter gene in operative linkage with a regulatory region 
responsive to test or known compounds. 
5 As used herein, amplification, with reference to DNA, is a process 

in which segments of DNA are duplicated to yield two or multiple copies 
of substantially similar or identical or nearly identical DNA segments that 
are typically joined as substantially tandem or successive repeats or 
inverted repeats. 

10 As used herein, amplification-based artificial chromosomes are 

artificial chromosomes derived from natural or endogenous chromosomes 
by virtue of an amplification event, such as one initiated by introduction 
of heterologous nucleic acid into rDNA in a chromosome. As a result of 
such an event, chromosomes and fragments thereof exhibiting segmented 

15 or repeating patterns arise. Artificial chromosomes can be formed from 
these chromosomes and fragments. Hence, amplification-based artificial 
chromosomes refer to engineered chromosomes that exhibit an ordered 
segmentation that Is not observed in naturally occurring chromosomes 
and that distinguishes them from naturally occurring chromosomes. The 

20 segmentation, which can be visualized using a variety of chromosome 
analysis techniques known to those of skill in the art, correlates with the 
structure of these artificial chromosomes. In addition to containing one or 
more centromeres, the amplification-based artificial chromosomes, 
throughout the region or regions of segmentation are predominantly made 

25 up of nucleic acid units also referred to as "ampllcons", that is (are) 

repeated in the region and that have a similar gross structure. Repeats of 
an amplicon tend to be of similar size and share some common nucleic 
acid sequences. For example, each repeat of an amplicon may contain a 
replication site involved in amplification of chromosome segments and/or 
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some heterologous nucleic acid that was utilized in the initial production 
of the artificial chromosome. Typically, the repeating units are 
substantially similar in nucleic acid composition and may be nearly 
identical. 

5 The amplification-based artificial chromosomes differ depending on 

the chromosomal region that has undergone amplification in the process 
of artificial chromosome formation. The structures of the resulting 
chromosomes can vary depending upon the initiating event and/or the 
conditions under which the heterologous nucleic acid is introduced, 
. 10 including modification to the endogenous chromosomes. For example, in 
some of the artificial chromosomes provided herein, the region or regions 
of segmentation may be made up predominantly of heterochromatic DNA. 
In other artificial chromosomes provided herein, the region or regions of 
segmentation may be made up predominantly of euchromatic DNA or may 

15 be made up of similar amounts of heterochromatic and euchromatic DNA. 
As used herein an amplicon is a repeated nucleic acid unit. In 
some of the artificial chromosomes described herein, an amplicon may 
contain a set of inverted repeats of a megareplicon. A megarepticon 
represents a higher order replication unit. For example, with reference to 

20 some of the predominantly heterochromatic artificial chromosomes, the 
megareplicon can contain a set of tandem DNA blocks {e.g., -7.5 Mb 
DNA blocks) each containing satellite DNA flanked by non-satellite DNA 
or may be made up of substantially rDNA. Contained within the 
megareplicon is a primary replication site, referred to as the 

25 megareplicator, which may be involved in organizing and facilitating 
replication of the pericentric heterochromatin and possibly the 
centromeres. Within the megareplicon there may be smaller {e.g., 50-300 
kb) secondary replicons. 
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In artificial chromosomes, such as those provided U.S. Patent Nos. 
6,025,155 and 6,077,697 and International PCT application No. WO 
97740183, the megareplicon is defined by two tandem blocks (--7.5 Mb 
DNA blocks in the chromosomes provided therein). Within each artificial 
5 chromosome or among a population thereof, each amplicon has the same 
gross structure but may contain sequence variations. Such variations will 
arise as a result of movement of mobile genetic elements, deletions or 
insertions or mutations that arise, particularly in culture. Such variation 
does not affect the use of the artificial chromosomes or their overall 

10 structure as described herein. 

As used herein, amplifiable, when used in reference to a 
chromosome, particularly the method of generating artificial chromosomes 
provided herein, refers to a region of a chromosome that is prone to 
amplification. Amplification typically occurs during replication and other 

15 cellular events involving recombination (e.g., DNA repair). Such regions 
include regions of the chromosome that contain tandem repeats, such as 
satellite DNA, rDNA, and other such sequences. 

As used herein, a dicentric chromosome is a chromosome that 
contains two centromeres. A multicentric chromosome contains more 

20 than two centromeres. 

As used herein, a formerly dicentric chromosome is a chromosome 
that is produced when a dicentric chromosome fragments and acquires 
new telomeres so that two chromosomes, each having one of the 
centromeres, are produced. Each of the fragments is a replicable 

25 chromosome. If one of the chromosomes undergoes amplification of 

primarily euchromatic DNA to produce a fully functional chromosome that 
is predominantly (at least more than 50%) euchromatin, it is a 
minichromosome. The remaining chromosome is a formerly dicentric 
chromosome. If one of the chromosomes undergoes amplification. 



I .1 



-21- 

whereby heterochromatin (such as, for example, satellite DNA) is 
amplified and a euchromatic portion <such as, for example, an arm) 
remains, it is referred to as a sausage chromosome. A chromosome that 
is substantially all heterochromatin, except for portions of heterologous 
5 DNA, is called a predominantly heterochromatic artificial chromosome. 
Predominantly heterochromatic artificial chromosomes can be produced 
from other partially heterochromatic artificial chromosomes by culturing 
the cell containing such chromosomes under conditions such as: BrdU 
treatment that destabilize the chromosome and/or growth under selective 

10 conditions so that a predominantly heterochromatic artificial chromosome 
is produced. For purposes herein, it is understood that the artificial 
chromosomes may not necessarily be produced in multiple steps, but may 
appear after the initial introduction of the heterologous DNA. Typically, 
artificial chromosomes appear after about 5 to about 60, or about 5 to 

15 about 55, or about 10 to about 55 or about 25 to about 55 or about 35 
to about 55 cell doublings after initiation of artificial chromosome 
generation, or they may appear after several cycles of growth under 
selective conditions and BrdU treatment. 

As used herein, an artificial chromosome that is predominantly 

20 heterochromatic (/'©., containing more heterochromatin than euchromatin, 
typically more than about 50%, more than about 70%, or more than 
about 90% heterochromatin) may be produced by introducing nucleic acid 
molecules into cells, such as, for example, animal or plant cells, and 
selecting cells that contain a predominantly heterochromatic artificial 

25 chromosome. Any nucleic acid may be introduced into cells in such 
methods of producing the artificial chromosomes. For example, the 
nucleic acid may contain a selectable marker and/or optionally a sequence 
that targets nucleic acid to the pericentric, heterochromatic region of a 
chromosome, such as in the short arm of acrocentric chromosomes and 
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nucleolar organizing regions. Targeting sequences include, but are not 
limited to, lambda phage DNA and rDNA for production of predominantly 
heterochromatic artificial chromosomes in eukaryotic cells. 

After introducing the nucleic acid into cells, a cell containing a 
5 predominantly heterochromatic artificial chromosome is selected. Such 
cells may be identified using a variety of procedures. For example, 
repeating units of heterochromatic DNA of these chromosomes may be 
discerned by G-banding and/or fluorescence in situ hybridization (FISH) 
techniques. Prior to such analyses, the cells to be analyzed may be 

10 enriched with artificial chromosome-containing cells by sorting the cells 
on the basis of the presence of a selectable marker, such as a reporter 
protein, or by growing (culturing) the cells under selective conditions. It 
is also possible, after introduction of nucleic acids into cells, to select 
cells that have a multicentric, typically dicentric, chromosome, a formerly 

1 5 multicentric {typically dicentric) chromosome and/or various 

heterochromatic structures, such as a megachromosome and a sausage 
chromosome, that contain a centromere and are predominantly 
heterochromatic and to treat them such that desired artificial 
chromosomes are produced. Cells containing a new chromosome are 

20 selected. Conditions for generation of a desired structure include, but are 
not limited to, further growth under selective conditions, introduction of 
additional nucleic acid molecules and/or growth under selective conditions 
and treatment with destabilizing agents, and other such methods (see 
International PCT application No. WO 97/40183 and U.S. Patent Nos. 

25 6,025,155 and 6,077,697). 

As used herein, a "selectable marker" is a nucleic acid segment, 
generally DNA, that allows one to select for or against a molecule or a 
cell that contains it, often under particular conditions. These markers can 
encode an activity, such as, but not limited to, production of RNA, 
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peptide, or protein, or can provide a binding site for RNA, peptides, 
proteins, inorganic and organic compounds and compositions. Examples 
of selectable markers include but are not limited to: (1) nucleic acid 
segments that encode products that provide resistance against otherwise 
5 toxic compounds (e.g., antibiotics); (2) nucleic acid segments that encode 
products that are otherwise lacking in the recipient cell (e.g., tRNA genes, 
auxotrophic markers); (3) nucleic acid segments that encode products 
that suppress the activity of a gene product; (4) nucleic acid segments 
that encode products that can be identified, such as phenotypic markers, 

10 Including iff-galactosidase, red, blue and/or green fluorescent proteins 
(FPs), and cell surface proteins; (5) nucleic acid segments that bind 
products that are otherwise detrimental to ceil survival and/or function; 
(6) nucleic acid segments that otherwise inhibit the activity of any of the 
nucleic acid segments described in Nos. 1-5 above (e.g., antisense 

15 oligonucleotides or siRNA molecules for use in RNA interference); (7) 
nucleic acid segments that bind products that modify a substrate (e.g. 
restriction endonucleases); (8) nucleic acid segments that can be used to 
isolate a desired molecule (e.g. specific protein binding sites); (9) nucleic 
acid segments that encode a specific nucleotide sequence that can be 

20 otherwise non-functional, such as for PGR amplification of subpopulations 
of molecules; and/or (10) nucleic acid segments, which when absent, 
directly or indirectly confer sensitivity to particular compounds. Thus, for 
example, selectable markers include nucleic acids encoding fluorescent 
proteins, such as green fluorescent proteins, i?-galactosidase and other 

25 readily detectable proteins, such as chromogenic proteins or proteins 
capable of being bound by an antibody and FACs sorted. Selectable 
markers such as these, which are not required for cell survival and/or 
proliferation in the presence of a selection agent, are also referred to 
herein as reporter molecules. Other selectable markers, e.g., the 
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siRNA encoding sequences under the regulation of a regulatable or 
inducible promoter would allow one to temporally and/or spatially control 
the knockdown effect of the corresponding gene. 

As used herein, a reporter gene includes any gene that expresses a 

5 detectable gene product, which may be RNA or protein. Generally 

reporter genes are readily detectable. Examples of reporter genes include, 
but are not limited to nucleic acid encoding a fluorescent protein, CAT 
{chloramphenicol acetyl transferase) (Alton etal. (1979) Nature 282: 864- 
869) luciferase, and other enzyme detection systems, such as beta- 

10 galactosidase; firefly luciferase (deWet etal. (1987) MoL Cell. Biol. 

7:725-737); bacterial luciferase (Engebrecht and Silverman (1984) Proc. 
Natl. Acad. ScL U.S.A. e/:41 54-41 58; Baldwin etal. (1984) 
Biochemistry 23:3663-3667); and alkaline phosphatase (Toh etal. (1989) 
Eur. J. Biochem. 762:231-238, Hall etal. (1983) J. MoL AppL Gen. 

15 2:101). 

As used herein, growth under selective conditions means growth of 
a cell under conditions that require expression of a selectable marker for 
survival. 

As used herein, an agent that destabilizes a chromosome is any 
20 agent known by those skilled In the art to enhance amplification events, 
and/or mutations. Such agents, which include BrdU, are well known to 
those skilled in the art. 

In order to generate an artificial chromosome containing a particular 
heterologous nucleic acid of interest, it is possible to include the nucleic 
25 acid in the nucleic acid that is being Introduced into cells to initiate 
production of the artificial chromosome. Thus, for example, a nucleic 
acid can be introduced into a cell along with nucleic acid encoding a 
selectable marker and/or a nucleic acid that targets to a heterochromatic 
region of a chromosome. For introducing a heterologous nucleic acid into 
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the cell, it can be included in a fragment that includes a selectable marker 
or as part of a separate nucleic acid fragment and introduced into the cell 
with a selectable marker during the process of generating the artificial 
chromosomes. Alternatively, heterologous nucleic acid can be introduced 
5 into an artificial chromosome at a later time after the initial generation of 
the artificial chromosome. 

As used herein, the minichromosome refers to a chromosome 
derived from a multicentric, typically dicentric, chromosome that contains 
more euchromatic than heterochromatic DNA, For purposes herein, the 

10 minichromosome contains a de novo centromere (e.g., a neocentromere). 
In some embodiments, for example, the minichromosome contains a 
centromere that replicates in animals, e.g., a mammalian centromere or in 
plants, e.g., a plant centromere. 

As used herein, in vitro assembled artificial chromosomes or 

15 synthetic chromosomes can be either more euchromatic than 

heterochromatic or more heterochromatic than euchromatic and are 
produced by joining essential components of a chromosome in vitro. 
These components include at least a centromere, a megareplicator, a 
telomere and optionally secondary origins of replication. 

20 As used herein, in vitro assembled plant or animal artificial 

chromosomes are produced by joining essential components (at least the 
centromere, telomere(s), megareplicator and optional secondary origins of 
replication) that function in plants or animals. In particular embodiments, 
the megareplicator contains sequences of rDNA, particularly plant or 

25 animal rDNA. 

As used herein, a plant is a eukaryotic organism that contains, in 
addition to a nucleus and mitochondria, chloroplasts capable of carrying 
out photosynthesis. A plant can be unicellular or multicellular and can 
contain multiple tissues and/or organs. Plants can reproduce sexually or 
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asexually and can be perennial or annual in growth. Plants can also be 
terrestrial or aquatic. The term "plant" includes a whole plant, plant cell, 
plant protoplast, plant calli, plant seed, plant organ, plant tissue, and 
other parts of a whole plant. 

As used herein, stable maintenance of chromosomes occurs when 
at least about 85%, preferably 90%, more preferably 95%, of the cells 
retain the chromosome. Stability is measured in the presence of a 
selective agent. Preferably these chromosomes are also maintained in the 
absence of a selective agent. Stable chromosomes also retain their 
structure during cell culturing, suffering no unintended intrachromosomal 
or interchromosomal rearrangements. 

As used herein, de novo with reference to a centromere, refers to 
generation of an excess centromere in a chromosome as a result of 
incorporation of a heterologous nucleic acid fragment using the methods 
herein. 

As used herein, BrdU refers to 5-bromodeoxyuridine, which during 
replication is inserted in place of thymidine. BrdU is used as a mutagen; it 
also inhibits condensation of metaphase chromosomes during cell 
division. 

As used herein, ribosomal RNA (rRNA) is the specialized RNA that 
forms part of the structure of a ribosome and participates in the synthesis 
of proteins. Ribosomal RNA is produced by transcription of genes which, 
in eukaryotic cells, are present in multiple copies. In human cells, the 
approximately 250 copies of rRNA genes (i.e., genes which encode rRNA) 
per haploid genorne are spread out in clusters on at least five different 
chromosomes (chromosomes 13, 14, 15, 21 and 22). In mouse cells, the 
presence of ribosomal DNA (rDNA, which is DNA containing sequences 
that encode rRNA) has been verified on at least 1 1 pairs out of 20 mouse 
chromosomes (chromosomes 5, 6, 7, 9, 1 1, 12, 15, 16, 17, 18, and 19) 
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(see e.g., Rowe etaL (1996) Mamm. Genome 7:886-889 and Johnson et 
al. (1993) Mamm. Genome 4:49-52). In Arabidopsis thaliand the 
presence of rDNA has been verified on chromosomes 2 and 4 (IBS, 5.8S, 
and 25S rDNA) and on chromosomes 3,4, and 5 (5S rDNA) (see The 
5 Arabidopsis Genome Initiative (2000) Nature 408:796-815). In 

eul<aryotic cells, the multiple copies of the highly conserved rRNA genes 
are located in a tandemly arranged series of rDNA units, which are 
generally about 40-45 kb in length and contain a transcribed region and a 
nontranscrlbed region known as spacer (i.e., intergenic spacer) DNA 

10 which can vary in length and sequence. In the human and mouse, these 
tandem arrays of rDNA units are located adjacent to the pericentric 
satellite DNA sequences (heterochromatin) . The regions of these 
chromosomes in which the rDNA is located are referred to as nucleolar 
organizing regions (NOR) which loop into the nucleolus, the site of 

15 ribosome production within the cell nucleus. 

As used herein, a megachromosome refers to a chromosome that, 
except for introduced heterologous DNA, is substantially composed of 
heterochromatin. Megachromosomes are made up of an array of repeated 
amplicons that contain two inverted megareplicons bordered by 

20 introduced heterologous DNA (see, e.g., Figure 3 of U.S. Patent No. 
6,077,697 for a schematic drawing of a megachromosome). For 
purposes herein, a megachromosome is about 50 to 400 Mb, generally 
about 250-400 Mb. Shorter variants are also referred to as truncated 
megachromosomes (about 90 to 120 or 150 Mb), dwarf 

25 megachromosomes (-150-200 Mb), and a micro-megachromosome 
(-50-90 Mb, typically 50-60 Mb). For purposes herein, the term 
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megachromosome refers to the overall repeated structure based on an 
array of repeated chromosomal segments (amplicons) that contain two 
inverted megareplicons bordered by any inserted heterologous DNA. The 
size will be specified. 
5 As used herein, gene therapy involves the transfer or insertion of 

nucleic acid molecules into certain cells, which are also referred to as 
target cells, to produce specific products that are involved in preventing, 
curing, correcting, controlling or modulating diseases, disorders and 
deleterious conditions. The nucleic acid is introduced into the selected 

10 target cells in a manner such that the nucleic acid Is expressed and a 

product encoded thereby is produced. Alternatively, the nucleic acid may 
In some manner mediate expression of DNA that encodes a therapeutic 
product. This product may be a therapeutic compound, which is 
produced in therapeutically effective amounts or at a therapeutically 

15 useful time. It may also encode a product, such as a peptide or RNA, 
that in some manner mediates, directiy or indirectiy, expression of a 
therapeutic product. Expression of the nucleic acid by the target cells 
within an organism afflicted with a disease or disorder thereby provides 
for modulation of the disease or disorder. The nucleic acid encoding the 

20 therapeutic product may be modified prior to introduction into the cells of 
the afflicted host in order to enhance or otherwise alter the product or 
expression thereof. 

For use in gene therapy, cells can be transfected in vitro, followed 
by introduction of the transfected cells into an organism. This is often 

25 referred to as ex vivo gene therapy. Alternatively, the cells can be 
transfected directiy in vivo within an organism. 

As used herein, therapeutic agents include, but are not limited to, 
growth factors, antibodies, cytokines, such as tumor necrosis factors and 
interleukins, and cytotoxic agents and other agents disclosed herein and 
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known to those of skill in the art. Such agents include, but are not 
limited to, tumor necrosis factor, a-interferon, )ff-interferon, nerve growth 
factor, platelet derived growth factor, tissue plasminogen activator; or, 
biological response modifiers such as, for example, lymphokines, 
5 interleukin- 1 (IL-I), interleukin-2 (IL-2), interleukin-6 (IL-6), granulocyte 
macrophage colony stimulating factor (GMCSF), granulocyte colony 
stimulating factor (G-CSF), erythropoietin (EPO), pro-coagulants such as 
tissue factor and tissue factor variants, pro-apoptotic agents such FAS- 
ligand, fibroblast growth factors (FGF), nerve growth factor and other 

10 growth factors. 

As used herein, a therapeutically effective product is a product that 
is encoded by heterologous DNA that, upon introduction of the DNA into 
a host, a product is expressed that effectively ameliorates or eliminates 
the symptoms, manifestations of an inherited or acquired disease or that 

1 5 cures the disease. 

As used herein, transgenic plants and animals refer to plants and 
animals in which heterologous or foreign nucleic acid is expressed or in 
which the expression of a gene naturally present in the plant or animal 
has been altered by virtue of introduction of heterologous or foreign 

20 nucleic acid. 

As used herein, IRES (internal ribosome entry site; see, e,g., SEQ 
ID No. 27 and nucleotides 2736-3308 SEQ 10 No. 28) refers to a region 
of a nucleic acid molecule, such as an mRNA molecule, that allows 
internal ribosome entry sufficient to initiate translation, which initiation 

25 can be detected in an assay for cap-independent translation (see, e.g., 
U.S. Patent No. 6,171,821). The presence of an IRES within an mRNA 
molecule allows cap-independent translation of a linked protein-encoding 
sequence that otherwise would not be translated. 
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Internal ribosome entry site (IRES) elements were first identified in 
picornaviruses, which elements are considered the paradigm for cap- 
independent translation. The 5' UTRs of all picornaviruses are long and 
mediate translational initiation by directly recruiting and binding 
ribosomes, thereby circumventing the initial cap-binding step. IRES 
elements are frequently found in viral mRNA, they are rare in non-viral 
mRNA. Among non-viral mRNA molecules that contain functional IRES 
elements in their respective 5' UTRs are those encoding immunoglobulin 
heavy chain binding protein (BiP) (IS/lacejak etal. (1991) Nature 
355:90-94); Drosophila Antennapedia (Oh etal. (1992) Genes Dev, 
6:1643-1653); D. Ultrabithorax (Ye etal. (1997) Mol. Cell Biol. 
77:1714-21); fibroblast growth factor 2 (Vagner etal. (1995) Mol. Cell 
Biol. /5:35-44); initiation factor elF4G (Gan etal. (1998) J. Biol. Chem. 
273:5006-5012); proto-oncogene c-myc (Nanbru etal. (1995) J. Biol. 
Chem. 272:32061-32066; Stoneley (1998) Oncogene 76:423-428); 
IRESh; from the 5'UTR of NRF1 gene (Oumard et al, (2000) Mol. and Cell 
Biol., 20(8):2755-2759); and vascular endothelial growth factor (VEGF) 
(Stein etal. (1998) Mol. Cell Biol. /S:3112-9). 

As used herein, a promoter, with respect to a region of DNA, refers 
to a sequence of DNA that contains a sequence of bases that signals RNA 
polymerase to associate with the DNA and initiate transcription of RNA 
(such as pol II for mRNA) from a template strand of the DNA. A promoter 
thus generally regulates transcription of DNA into mRNA. A particular 
promoter provided herein is the Ferritin heavy chain promoter (excluding 
the Iron Response Element, located in the 5'UTR), which was joined to 
the 37bp Fer-1 enhancer element. This promoter is set forth as SEQ ID 
N0:128. The endogenous Fer-1 enhancer element is located upstream of 
the Fer-1 promoter (e.g., a Fer-1 oligo was cloned proximal to the core 
promoter). 
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As used herein, isolated, substantially pure nucleic acid, such as, 
for example, DNA, refers to nucleic acid fragments purified according to 
standard techniques employed by those skilled in the art, such as that 
found in Sambrook etal. ((2001) Molecular Cloning: A Laboratory 
5 Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
3rd edition). 

As used herein, expression refers to the transcription and/or 
translation of nucleic acid. For example, expression can be the 
transcription of a gene that may be transcribed into an RNA molecule, 

10 such as a messenger RNA (mRNA) molecule. Expression may further 
include translation of an RNA molecule and translated into peptides, 
polypeptides, or proteins. If the nucleic acid is derived from genomic 
DNA, expression may, if an appropriate eukaryotic host cell or organism is 
selected, include splicing of the mRNA. With respect to an antisense 

1 5 construct, expression may refer to the transcription of the antisense DNA. 
As used herein, vector or plasmid refers to discrete elements that 
are used to introduce heterologous nucleic acids into cells for either 
expression of the heterologous nucleic acid or for replication of the 
heterologous nucleic acid. Selection and use of such vectors and 

20 plasmids are well within the level of skill of the art. 

As used herein, transformation/transfection refers to the process by 
which nucleic acid is introduced into cells. The terms transfection and 
transformation refer to the taking up of exogenous nucleic acid, e.g., an 
expression vector, by a host cell whether or not any coding sequences 

25 are in fact expressed. Numerous methods of transfection are known to 
the ordinarily skilled artisan, for example, by Agrobacterium-mediated 
transformation, protoplast transformation (including polyethylene glycol 
(PEG)-mediated transformation, electroporation, protoplast fusion, and 
microcell fusion), lipid-mediated delivery, liposomes, electroporation, 
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sohoporation, microinjection, particle bombardment and silicon carbide 
whisker-mediated transformation and combinations thereof (see, e.g., 
Paszkowski etal. (1984) EMBOJ. 5:2717-2722; Potrykus etal. (1985) 
MoL Gen. Genet. 7SS:169-177; Reich etaL (1986) Biotechnology 
5 4:1001-1004; Klein etal. (1987) Nature 327:70-73; U.S. Patent No, 
6,143,949; Paszkowski etaL (1989) in Cell Culture and Somatic Cell 
Genetics of Plants, Vol. 6, Molecular Biology of Plant Nuclear Genes, eds. 
Schell, J and Vasil, L.K. Academic Publishers, San Diego, California, p. 
52-68; and Frame etal. (1994) Plant J. 6:941-948), direct uptake using 

10 calcium phosphate (CaP04; see,e.fl'., Wigler etal. (1979) Proc. Natl. 
Acad, Sci. U.S.A. 76:1373-1376), polyethylene glycol {PEG)-mediated 
DNA uptake, lipofection (see, e.g., Strauss (1996) Math. Mol. Biol. 
54:307-327), microcell fusion (see, EXAMPLES, see, also Lambert (1991) 
Proc. Natl. Acad. Sci. U.S.A. 66:5907-5911; U.S. Patent No. 5,396,767, 

15 Sawford etal. (1987) Somatic Ceil Mol. Genet. ;5:279-284; Dhar etal. 
(1984) Somatic Cell Mol. Genet /0:547-559; and McNeill-Killary etal. 

(1995) Meth. Enzymol. 254:133-152), lipid-mediated carrier systems 
(see, e.g., Teifel etal. (1995) Biotechniques /S:79-80; Albrecht ef a/. 

(1996) Ann. Hematol. 72:73-79; Holmen etal. (1995) In Vitro Cell Dev. 
20 Biol. Anim. 3/:347-351; Remy etal. (1994) Bioconjug. Chem. 5:647- 

654; Le Bolch etal. (1995) Tetrahedron Lett. 56:6681-6684; Loeffler et 
aL (1993) Meth. EnzymoL 2/7:599-618) or other suitable method- 
Methods for delivery of ACes are described in copending U.S. application 
Serial No. 09/815,979. Successful transfection is generally recognized 
25 by detection of the presence of the heterologous nucleic acid within the 
transfected cell, such as, for example, any visualization of the 
heterologous nucleic acid or any indication of the operation of a vector 
within the host cell. 
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As used herein, "delivery/ which Is used interchangeably with 
"transfection/ refers to the process by which exogenous nucleic acid 
molecules are transferred into a cell such that they are located inside the 
cell. Delivery of nucleic acids is a distinct process from expression of 
5 nucleic acids. 

As used herein, injected refers to the microinjection, such as by 
use of a small syringe, needle, or pipette, for injection of nucleic acid into 
a cell. 

As used herein, substantially homologous DNA refers to DNA that 

1 0 includes a sequence of nucleotides that is sufficiently similar to another 
such sequence to form stable hybrids, with each other or a reference 
sequence, under specified conditions. 

It is well known to those of skill in this art that nucleic acid 
fragments with different sequences may, under the same conditions, 

15 hybridize detectably to the same "target" nucleic acid. Two nucleic acid 
fragments hybridize detectably, under stringent conditions over a 
sufficiently long hybridization period, because one fragment contains a 
segment of at least about 10, 14 or 16 or more nucleotides in a sequence 
that is complementary (or nearly complementary) to a substantially 

20 contiguous sequence of at least one segment in the other nucleic acid 
fragment. If the time during which hybridization is allowed to occur is 
held constant, at a value during which, under preselected stringency 
conditions, two nucleic acid fragments with complementary base-pairing 
segments hybridize detectably to each other, departures from exact 

25 complementarity can be introduced into the base-pairing segments, and 
base-pairing will nonetheless occur to an extent sufficient to make 
hybridization detectable. As the departure from complementarity between 
the base-pairing segments of two nucleic acids becomes larger, and as 
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conditions of the hybridization become more stringent the probability 
decreases that the two segments will hybridize detectably to each other. 

Two single-stranded nucleic acid segments have "substantially the 
same sequence", if (a) both form a base-paired duplex with the same 
5 segment, and (b) the melting temperatures of the two duplexes in a 
solution of 0.5 X SSPE differ by less than lOoC. If the segments being 
compared have the same number of bases, then to have "substantially 
the same sequence", they will typically differ in their sequences at fewer 
than 1 base in 1 0. Methods for determining melting temperatures of 

10 nucleic acid duplexes are well known (see, e.g., Meinkoth etal. (1984) 
Anal. Biochem. 755:267-284 and references cited therein). 

As used herein, a nucleic acid probe is a DNA or RNA fragment 
that includes a sufficient number of nucleotides to specifically hybridize to 
DNA or RNA that includes complementary or substantially complementary 

15 sequences of nucleotides. A probe may contain any number of 
nucleotides, from as few as about 10 and as many as hundreds of 
thousands of nucleotides. The conditions and protocols for such 
hybridization reactions are well known to those of skill in the art as are 
the effects of probe size, temperature, degree of mismatch, salt 

20 concentration and other parameters on the hybridization reaction. For 
example, the lower the temperature and higher the salt concentration at 
which the hybridization reaction is carried out, the greater the degree of 
mismatch that may be present in the hybrid molecules. 

To be used as a hybridization probe, the nucleic acid is generally 

25 rendered detectable by labeling it with a detectable moiety or label, such 
as ^^P, and ^"^C, or by other means, including chemical labeling, such 
as by nick-translation in the presence of deoxyuridylate biotinylated at the 
5'-position of the uracil moiety. The resulting probe includes the 
biotinylated uridylate in place of thymidylate residues and can be detected 
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(via the blotin moieties) by any of a number of commercially available 
detection systems based on binding of streptavidin to the biotin. Such 
commercially available detection systems can be obtained, for example, 
from Enzo Biochemicals, Inc. (New York, NY). Any other label known to 
5 those of skill in the art, including non-radioactive labels, may be used as 
long as it renders the probes sufficiently detectable, which is a function of 
the sensitivity of the assay, the time available (for culturing cells, 
extracting DNA, and hybridization assays), the quantity of DNA or RNA 
available as a source of the probe, the particular label and the means used 
10 to detect the label. 

Once sequences with a sufficiently high degree of homology to the 
probe are identified, they can readily be isolated by standard techniques 
(see, e.g., Sambrook etal. (2001) Molecular Cloning: A Laboratory 
Manual, 3rd Edition, Cold Spring Harbor Laboratory Press). 
1 5 As used herein, conditions under which DNA molecules form stable 

hybrids are considered substantially homologous, and a DNA or nucleic 
acid homolog refers to a nucleic acid that includes a preselected 
conserved nucleotide sequence, such as a sequence encoding a 
polypeptide. By the term "substantially homologous" is meant having at 
20 least 75%, preferably 80%, preferably at least 90%, most preferably at 
least 95% homology therewith or a less percentage of homology or 
identity and conserved biological activity or function. 

The terms "homology" and "identity" are often used 
interchangeably. In this regard, percent homology or identity may be 
25 determined, for example, by comparing sequence information using a GAP 
computer program. The GAP program utilizes the alignment method of 
Needleman and Wunsch (J. MoL BioL 48:443 (1970), as revised by Smith 
and Waterman (Adv. AppL Math. 2:482 (1981). Briefly, the GAP 
program defines similarity as the number of aligned symbols (i.e., 
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nucleotides or amino acids) wliich are similar, divided by the total number 
of symbols in the shorter of the two sequences. The preferred default 
parameters for the GAP program may include: (1) a unary comparison 
matrix (containing a value of 1 for identities and 0 for non*identities) and 
5 the weighted comparison matrix of Gribsicov and Burgess, Nucl. Acids 
Res. 14:6745 (1986), as described by Schwartz and Dayhoff, eds., 
ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, National Biomedical 
Research Foundation, pp. 353-358 (1979); (2) a penalty of 3.0 for each 
gap and an additional 0.10 penalty for each symbol in each gap; and (3) 

10 no penalty for end gaps. 

By sequence identity, the number of conserved amino acids are 
determined by standard alignment algorithms programs, and are used with 
default gap penalties established by each supplier. Substantially 
homologous nucleic acid molecules would hybridize typically at moderate 

15 stringency or at high stringency all along the length of the nucleic acid of 
interest. Preferably the two molecules will hybridize under conditions of 
high stringency. Also contemplated are nucleic acid molecules that 
contain degenerate codons in place of codons in the hybridizing nucleic 
acid molecule. 

20 Whether any two nucleic acid molecules have nucleotide sequences 

that are at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% 
"identical" can be determined using known computer algorithms such as 
the "FAST A" program, using for example, the default parameters as in 
Pearson and Lipman, Proc. Natl. Acad. Set. USA 85:2444 (1988). 

25 Alternatively the BLAST function of the National Center for Biotechnology 
Information database may be used to determine relative sequence 
identity. 

In general, sequences are aligned so that the highest order match 
is obtained. "Identity" perse has an art-recognized meaning and can be 
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calcuiated using published techniques. (See, e.^.: Computational 
Molecular Biology i Lesk, A.iVI., ed,, Oxford University Press, New York, 
1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., 
Academic Press, New York, 1 993; Computer Analysis of Sequence Data, 
5 Part I, Griffin, A.l\/I., and Griffin, H.G., eds., Humana Press, New Jersey, 
1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic 
Press, 1 987; and Sequence Analysis Primer, Gribskov, M. and Devereux, 
J., eds., M Stockton Press, New York, 1991). While there exist a number 
of methods to measure identity between two polynucleotide or 

10 polypeptide sequences, the term "Identity" is well known to skilled 

artisans (Carillo, H. & Lipton, D., SIAM J Applied Math 4S:1073 (1988)). 
Methods commonly employed to determine identity or similarity between 
two sequences include, but are not limited to, those disclosed in Guide to 
Huge Computers, Martin J. Bishop, ed., Academic Press, San Diego, 

15 1994, and Carillo, H. & Lipton, D., SIAM J Applied Math 48'AOIZ 
(1988). Methods to determine identity and similarity are codified in 
computer programs. Preferred computer program methods to determine 
identity and similarity between two sequences include, but are not limited 
to, GCG program package (Devereux, J., etaL, Nucleic Acids Research 

20 72fl/:387 (1984)), BLASTP, BLASTN, FASTA (Atschul, S.F., etal., J 
Molec Bio/ 275:403 (1990)). 

Therefore, as used herein, the term "identity" represents a 
comparison between a test and a reference polypeptide or polynucleotide. 
For example, a test polypeptide may be defined as any polypeptide that is 

25 90% or more identical to a reference polypeptide. 

As used herein, the term at least "90% identical to" refers to 
percent identities from 90 to 99.99 relative to the reference polypeptides. 
Identity at a level of 90% or more is indicative of the fact that, assuming 
for exemplification purposes a test and reference polynucleotide length of 
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100 amino acids are compared. No more than 10% (i.e., 10 out of 100) 
amino acids in the test polypeptide differs from that of the reference 
polypeptides. Similar comparisons may be made between a test and 
reference polynucleotides. Such differences may be represented as point 
5 mutations randomly distributed over the entire length of an amino acid 
sequence or they may be clustered in one or more locations of varying 
length up to the maximum allowable, e.g. 10/100 amino acid difference 
(approximately 90% identity). Differences are defined as nucleic acid or 
amino acid substitutions, or deletions. 
10 As used herein: stringency of hybridization in determining 

percentage mismatch encompass the following conditions or equivalent 
conditions thereto: 

1) high stringency: 0.1 x SSPE or SSC, 0.1% SDS, 65°C 

2) medium stringency: 0.2 x SSPE or SSC, 0.1% SDS, 50°C 
15 3) low stringency: 1 .0 x SSPE or SSC, 0.1 % SDS, 50*^C 

or any combination of salt and temperature and other reagents that result 
in selection of the same degree of mismatch or matching. Equivalent 
conditions refer to conditions that select for substantially the same 
percentage of mismatch in the resulting hybrids. Additions of ingredients, 

20 such as formamide, FIcoll, and Denhardt's solution affect parameters such 
as the temperature under which the hybridization should be conducted 
and the rate of the reaction. Thus, hybridization in 5 X SSC, in 20% 
formamide at 42** C is substantially the same as the conditions recited 
above hybridization under conditions of low stringency. The recipes for 

25 SSPE, SSC and Denhardt's and the preparation of deionized formamide 
are described, for example, in Sambrook et al. (1989) Molecular Cloning^ 
A Laboratory Manual, Cold Spring Harbor Laboratory Press, Chapter 8; 
see, Sambrook et al., vol. 3, p. B.13, see, also, numerous catalogs that 
describe commonly used laboratory solutions. It is understood that 
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equivalent stringencies may be achieved using alternative buffers, salts 
and temperatures. As used herein, all assays and procedures, such as 
hybridization reactions and antibody-antigen reactions, unless otherwise 
specified, are conducted under conditions recognized by those of skill in 
5 the art as standard conditions. 

As used herein, conservative amino acid substitutions, such as 
those set forth in Table 1 , are those that do not eliminate biological 
activity. Suitable conservative substitutions of amino acids are known to 
those of skill in this art and may be made generally without altering the 

10 biological activity of the resulting molecule. Those of skill in this art 

recognize that, in general, single amino acid substitutions in non-essential 
regions of a polypeptide do not substantially alter biological activity (see, 
e.g., Watson et al. Molecular Biology of the Gene, 4th Edition, 1987, The 
Bejacmin/Cummings Pub. co., p. 224). Conservative amino acid 

1 5 substitutions are made, for example, in accordance with those set forth in 
TABLE 1 as follows: 
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TABLE 1 



Original residue 


Conservative substitution 


Ala (A) 


Gly; Ser, Abu 


Arg (R) 


Lys, orn 


Asn (N) 


Gin; His 


Cys (C) 


Ser 


Gin (Q) 


Asn 


Glu (E) 


Asp 


Gly <G) 


Ala; Pro 


His (H) 


Asn; Gin 


lie (1) 


Leu; Val; Met; Nie; Nva 


Leu (L) 


lie; Val; Met; NIe; Nva 


Lys (K) 


Arg; Gin; Glu 


Met (M) 


Leu; Tyr; lie; NLe Val 


Ornithine 


Lys; Arg 


Phe (F) 


Met; Leu; Tyr 


Ser (S) 


Thr 


Thr (T) 


Ser 


Trp (W) 


Tyr 


Tyr (Y) 


Trp; Phe 


Val (V) 


lie; Leu; Met; NIe; Nva 
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Other substitutions are also permissible and may be determined 
empirically or in accord with known conservative substitutions. 

As used herein, the amino acids, which occur in the various amino 
acid sequences appearing herein, are Identified according to their well- 
known, three-letter or one-letter abbreviations. The nucleotides, which 
occur in the various DNA fragments, are designated with the standard 
single-letter designations used routinely in the art. 

As used herein, a splice variant refers to a variant produced by 
differential processing of a primary transcript of genomic DNA that results 
in more than one type of mRNA. 

As used herein, a probe or primer based on a nucleotide sequence 
Includes at least 10, 14, 16, 30 or 100 contiguous nucleotides from the 
reference nucleic acid molecule. 

As used herein, recombinant production by using recombinant DNA 
methods refers to the use of the well Icnown methods of molecular 
biology for expressing proteins encoded by cloned DNA. 

As used herein, biological activity refers to the in vivo activities of 
a compound or physiological responses that result upon in vivo 
administration of a compound, composition or other mixture. Biological 
activity, thus, encompasses therapeutic effects and pharmaceutical 
activity of such compounds, compositions and mixtures. Biological 
activities may be observed In in vitro systems designed to test or use 
such activities. Thus, for purposes herein the biological activity of a 
luclferase is Its oxygenase activity whereby, upon oxidation of a 
substrate, light Is produced. 
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The terms substantially identical or similar varies with the context 
as understood by those skilled in the relevant art and generally means at 
least 40, 60, 80, 90, 95 or 98%. 

As used herein, substantially identical to a product means 
5 sufficiently similar so that the property is sufficiently unchanged so that 
the substantially identical product can be used in place of the product. 

As used herein, substantially pure means sufficiently homogeneous 
to appear free of readily detectable impurities as determined by standard 
methods of analysis, such as thin layer chromatography (TLC), gel 
10 electrophoresis and high performance liquid chromatography (HPLC), used 
by those of skill in the art to assess such purity, or sufficiently pure such 
that further purification would not detectably alter the physical and 
chemical properties, such as enzymatic and biological activities, of the 
substance. Methods for purification of the compounds to produce 
15 substantially chemically pure compounds are known to those of skill in 
the art. A substantially chemically pure compound may, however, be a 
mixture of stereoisomers or isomers. In such instances, further 
purification might increase the specific activity of the compound. 

As used herein, vector (or plasmid) refers to discrete elements that 
20 are used to introduce heterologous DNA Into cells for either expression or 
replication thereof. The vectors typically remain episomal, but may be 
designed to effect integration of a gene or portion thereof into a 
chromosome of the genome. Also contemplated are vectors that are 
artificial chromosomes, such as yeast artificial chromosomes and 
25 mammalian artificial chromosomes. Selection and use of such vehicles 
are well known to those of skill in the art. An expression vector includes 
vectors capable of expressing DNA that is operatively linked with 
regulatory sequences, such as promoter regions, that are capable of 
effecting expression of such DNA fragments. Thus, an expression vector 
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refers to a recombinant DNA or RNA construct, such as a plasmid, a 

phage, recombinant virus or other vector that, upon introduction into an 

appropriate host cell, results in expression of the cloned DNA. 

Appropriate expression vectors are well known to those of skill in the art 
5 and include those that are replicable in eukaryotic cells and/or prokaryotic 

cells and those that remain episomal or those which integrate into the 

host cell genome. 

As used herein, protein-binding-sequence refers to a protein or 

peptide sequence that Is capable of specific binding to other protein or 
10 peptide sequences generally, to a set of protein or peptide sequences or 

to a particular protein or peptide sequence. 

As used herein, a composition refers to any mixture of two or more 

ingredients. It may be a solution, a suspension, liquid, powder, a paste, 

aqueous, non-aqueous or any combination thereof. 
15 As used herein, a combination refers to any association between 

two or more items. 

As used herein, fluid refers to any composition that can flow. 

Fluids thus encompass compositions that are In the form of semi-solids, 

pastes, solutions, aqueous mixtures, gels, lotions, creams and other such 
20 compositions. 

As used herein, a cellular extract refers to a preparation or fraction 

that is made from a lysed or disrupted cell. 

As used herein, the term "subject" refers to animals, plants, 

insects, and birds and other phyla, genera and species into which nucleic 
25 acid molecules may be introduced. Included are higher organisms, such 

as mammals, fish, insects and birds, including humans, primates, cattle, 

pigs, rabbits, goats, sheep, mice, rats, guinea pigs, hamsters, cats, dogs, 

horses, chicken and others. 
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As used herein, flow cytometry refers to processes that use a laser 
based instrument capable of analyzing and sorting out cells and or 
chromosomes based on size and fluorescence. 

As used herein, the abbreviations for any protective groups, amino 
5 acids and other compounds, are, unless indicated otherwise, in accord 
with their common usage, recognized abbreviations, or the lUPAC-lUB 
Commission on Biochemical Nomenclature {see, (1972) Biochem, 
; 7:942-944). 

B. Recombination systems 

1 0 Site-specific recombination systems typically contain three 

elements: a pair of DNA sequences (the site-specific recombination 
sequences) and a specific enzyme {the site-specific recombinase). The 
site-specific recombinase catalyzes a recombination reaction between two 
site-specific recombination sequences. 

15 A number of different site-specific recombinase systems are 

available and/or known to those of skill in the art, including, but not 
limited to: the Cre//ox recombination system using CRE recombinase {see, 
e.g., SEQ ID Nos. 58 and 59) from the Escherichia coli phage PI {see, 
e.g., Sauer (1 993) Methods in Enzymology 225:890-900; Sauer et al. 

20 {1 990) The New Biologist 2:441 -449), Sauer {1 994) Current Opinion in 
Biotechnology 5\^2'\-^ZJ\ Odell etal. (1990) Mol Gen Genet. 223:369- 
378; Lasko Wa/. (1992) Proc. Natl. Acad. Sci. U.S.A. 55:6232-6236; 
U.S. Patent No. 5,658,772), the FLP/FRT system of yeast using the FLP 
recombinase (see, SEQ ID Nos. 60 and 61) from the 2// episome of 

25 Saccharomyces cerevisiae (Cox (1983) Proc. Natl. Acad. Sci. U.S.A. 
S0:4223; Faico etal. (1982) Ce// 25:573-584; Golic etal. (1989) 
Ce//59:499-509; U.S. Patent No. 5,744,336), the resolvases, including 
Gin recombinase of phage Mu (Maeser et aL (1991) Mol Gen Genet 
230:170-176; Klippel, A. et al (1993) EMBOJ. 72:1047-1057; see, e.g., 
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SEQ ID Nos. 64-67), Cin, Hin, a6 Tn3; the Pin recombinase of £. coli 
(see, e.g., SEQ ID Nos. 68 and 69; Enomoto etal. CiBSS) J BacterioL 
6:663-668), the R/RS system of the pSRI plasmid of Zygosaccharonnyces 
rouxil (Araki etaL (1992) J. MoL BioL 225:25-37; Matsuzaki etal. (1990) 
5 J. Bacteriol. 172\ 610-618) and site-specific recombinases from 
Kluyveromyces drosophilarium (Chen etal. (1986) Nucleic Acids Res. 
3/4:4471-4481) and Kluyveromyces waltii (Chen etal. (1992) J. Gen. 
Microbiol. 735:337-345). Other systems are known to those of skill in 
the art (Stark etal. Trends Genet. 5:432-439; Utatsu etaL (1987) J. 

10 Bacteriol. 765:5537-5545; see, also, U.S. Patent No. 6,171,861). 

Members of the highly related family of site-specific recombinases, 
the resolvase family, such as k^, Tn3 resolvase, Hin, Gin, and Gin are also 
available. Members of this family of recombinases are typically 
constrained to intramolecular reactions (e.g., inversions and excisions) 

15 and can require host-encoded factors. Mutants have been isolated that 
relieve some of the requirements for host factors (Maeser etal. (1991) 
MoL Gen. Genet. 230:170-176), as well as some of the constraints 
of intramolecular recombination (see, U.S. Patent No. 6,171,861). 

The bacteriophage PI Cre/lox and the yeast FLP/FRT systems are 

20 particularly useful systems for site-specific integration, inversion or 
excision of heterologous nucleic acid into, and out of, chromosomes, 
particularly ACe$ as provided herein. In these systems a recombinase 
(Cre or FLP) interacts specifically with its respective srte-specific 
recombination sequence (lox or FRT, respectively) to invert or excise the 

25 intervening sequences. The sequence for each of these two systems is 
relatively short (34 bp for lox and 47 bp for FRT). 

The FLP/FRT recombinase system has been demonstrated to 
function efficiently in plant cells (U.S. Patent No. 5,744,386), and, thus, 
can be used for producing plant artificial chromosome platforms. In 



-46- 

general, short incomplete FRT sites leads to higher accumulation of 
excision products than the complete full-length FRT sites. The system 
catalyzes intra- and intermolecular reactions, and, thus, can be used for 
DNA excision and integration reactions. The recombination reaction is 
5 reversible and this reversibility can compromise the efficiency of the 
reaction in each direction. Altering the structure of the site-specific 
recombination sequences is one approach to remedying this situation. 
The site-specific recombination sequence can be mutated in a manner 
that the product of the recombination reaction is no longer recognized as 

10 a substrate for the reverse reaction, thereby stabilizing the integration or 
excision event. 

In the Cre-lox system, discovered in bacteriophage P1, 
recombination between loxP sites occurs in the presence of the Cre 
recombinase (see, e.g., U.S. Patent No. 5,658,772). This system can be 

15 used to insert, invert or excise nucleic acid located between two lox sites. 
Cre can be expressed from a vector. Since the lox site is an asymmetrical 
nucleotide sequence, lox sites on the same DNA molecule can have the 
same or opposite orientation with respect to each other. Recombination 
between lox sites in the same orientation results in a deletion of the DNA 

20 segment located between the two lox sites and a connection Ipetween the 
resulting ends of the original DNA molecule. The deleted DNA segment 
forms a circular molecule of DNA. The original DNA molecule and the 
resulting circular molecule each contain a single lox site. Recombination 
between lox sites in opposite orientations on the same DNA molecule 

25 result in an inversion of the nucleotide sequence of the DNA segment 
located between the two lox sites. In addition, reciprocal exchange of 
DNA segments proximate to lox sites located on two different DNA 
molecules can occur. All of these recombination events are catalyzed by 
the product of the Cre coding region. 
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Any site-specific recombinase system known to those of skill in the 
art is coritempiated for use herein. It is contemplated that one or a 
plurality of sites that direct the recombination by the recombinase are 
introduced into an artificial chromosome to produce platform ACes. The 
5 resulting platform ACes are introduced into cells with nucleic acid 

encoding the cognate recombinase, typically on a vector, and nucleic acid 
encoding heterologous nucleic acid of interest linked to the appropriate 
recombination site for insertion into the platform ACes. The recombinase- 
encoding-nucleic acid may be introduced into the cells on the same 

10 vector, or a different vector, encoding the heterologous nucleic acid. 

An E. coli phage lambda integrase system for ACes platform 
engineering and for artificial chromosome engineering is provided (Lorbach 
etal. (2000) J. Mol. a/o/ 256:1 175-1 181). The phage lambda integrase 
(Landy, A. (1989) Annu. Rev. Biochem. 55:913-94) is adapted herein and 

15 the cognate att sites are provided. Chromosomes, including ACes, 

engineered to contain one or a plurality of att sites are provided, as are 
vectors encoding a mutant integrase that functions in the absence other 
factors. Methods using the modified chromosomes and vectors for 
introduction of heterologous nucleic acid are also provided. 

20 For purposes herein, one or more of the sites (e.g., a single site or 

a pair of sites) required for recombination are introduced into an artificial 
chromosome, such as an ACes chromosome. The enzyme for catalyzing 
site-directed recombination is introduced with the DNA of interest, or 
separately, or is engineered onto the artificial chromosome under the 

25 control of a regulatable promoter. 

As described herein, artificial chromosome platforms containing one 
or multiple recombination sites are provided. The methods and resulting 
products are exemplified with the lambda phage Att/lnt system, but 
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similar methods may be used for production of ACes platforms with other 
recombination systems. 

The Att/Int system and vectors provided herein are not only 
intended for engineering ACes platforms, but may be used to engineer an 
5 Att/Int system into any chromosome. Introduction of att sites into a 
chromosome will permit engineering of natural chromosomes, such as by 
permitting targeted integration genes or regulatory regions, and by 
controlled excision of selected regions. For example, genes encoding a 
particular trait may be added to a chromosome, such as plant 
10 chromosome engineered to contain one or plurality of att s'rtes. Such 
chromosomes may be used for screening DNA to identify genes. Large 
pieces of DNA can be introduced into cells and the cells screened 
phenotypically to select those having the desired trait. 
C. Platforms 

15 Provided herein are platform artificial chromosomes (platform i4Ces) 

containing single or multiple site-specific recombination sites. 
Chromosome-based platform technology permits efficient and tractable 
engineering and subsequent expression of multiple gene targets. Methods 
are provided that use DNA vectors and fragments to create platform 

20 artificial chromosomes, including animal, particularly mammalian, artificial 
chromosomes, and plant artificial chromosomes. The artificial 
chromosomes contain either single or multiple sequence-specific 
recombination sites suitable for the placement of target gene expression 
vectors onto the platform chromosome. The engineered chromosome- 

25 based platform ACes technology is applicable for methods, including 
cellular and transgenic protein production, transgenic plant and animal 
production and gene therapy. The platform ACes are also useful for 
producing a library of ACes comprising random portions of a given 
genome (e.g., a mammalian, plant or prokaryotic genome) for genomic 
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screening; as well as a library of cells comprising different and/or mutually 
exclusive ACes therein. 

Exemplary of artificial chromosome platforms are those based on 
ACes. ACes artificial chromosomes are non-viral, self-replicating nucleic 
5 acid molecules that function as a natural chromosome, having ail the 
elements required for normal chromosomal replication and maintenance 
within the ceil nucleus. ACes artificial chronriosomes do not rely on 
integration into the genome of the cell to be effective, and they are not 
limited by DNA carrying capacity and as such the therapeutic gene(s) of 

10 interest, including regulatory sequences, can be engineered into the ACes. 
In addition, ACes are stable in vitro and in vivo and can provide 
predictable long-term gene expression. Once engineered and delivered to 
the appropriate cell or embryo, ACes work independently alongside host 
chromosomes, for ACes that are predominantly heterochromatin 

15 producing only the products (proteins) from the genes it carries. As 

provided herein ACes are modified by introduction of recombination site(s) 
to provide a platform for ready introduction of heterologous nucleic acid. 
The ACes platforms can be used for production of transgenic animals and 
plants; as vectors for genetic therapy; for use as protein production 

20 systems; for animal models to identify and target new therapeutics; in cell 
culture for the development and production of therapeutic proteins; and 
for a variety of other applications. 

1 . Generation of artificial chromosomes 

Artificial chromosomes may be generated by any method known to 
25 those of skill in the art. Of particular interest herein are the ACes artificial 
chromosomes, which contain a repeated unit. Methods for production of 
ACes are described in detail in U.S. Patent Nos. 6,025,155 and 
6,077,697, which, as with all patents, applications, publications and 
other disclosure, are incorporated herein in their entirety. 
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Generation of de novo ACes, 
ACes can be generated by cotransfecting exogenous DNA— such 
as a mammary tissue specific DNA cassette including the gene sequences 
for a therapeutic protein, with a rDNA fragment and a drug resistance 
5 marker gene into the desired eukaryotic cell, such as plant or animal cells, 
such as murine cells in vitro. DNA with a selectable or detectable marker 
Is introduced, and can be allowed to integrate randomly into pericentric 
heterochromatin or can be targeted to pericentric heterochromatin, such 
as that In rDNA gene arrays that reside on acrocentric chromosomes, 

10 such as the short arms of acrocentric chromosomes. This integration 
event activates the "megareplicator" sequence and amplifies the 
pericentric heterochromatin and the exogenous DNA, and duplicates a 
centromere. Ensuing breakage of this "dicentric" chromosome can result 
in the production of daughter cells that contain the substantially-original 

15 chromosome and the new artificial chromosome. The resulting ACes 
contain all the essential elements needed for stability and replication in 
dividing cells— centromere, origins of replications, and telomeres. ACes 
have been produced that express marker genes (lacZ, green fluorescent 
protein, neomycin-resistance, puromycin-reslstance, hygromycln- 

20 resistance) and genes of interest. Isolated ACes, for example, have been 
successfully transferred intact to rodent, human, and bovine cells by 
electroporatlon, sonoporation, microinjection, and transfection with lipids 
and dendrlmers. 

To render the creation of ACes with desired genes more tractable 
25 and efficient, "platform" ACes {platform->4 Ces) can be produced that 
contain defined DNA sequences for enzyme-mediated homologous DNA 
recombination, such as by Cre or FLP recomblnases (Bouhassira et at. 
(1996) Blood 88(supplement />/:190a; Bouhassira et al. (1997) Blood, 
50:3332-3344; Siebleref a/. 2^1) Biochemistry: 35:1740-1747; 



-51- 

Siebler et al, {1 998) Biochemistry 37: 6229-6234; and Bethke et al. 

(1997) NucL Acids Res. 25:2828-2834), and as exemplified herein the 

lambda phage integrase. A lox site contains two 1 3 bp inverted repeats 

to which Cre-recombinase binds and an intervening 8 bp core region. 

5 Only pairs of sites having identity in the central 6 bp of the core region 

are proficient for recombination; sites having non-identical core sequences 

(heterospecific lox sites) do not efficiently recombine with each other 

(Hoess etal. (1986) Nucleic Acids Res. 74:2287-2300) . 

Generating acrocentric chromosomes for plant 
10 artificial chromosome formation. 

In human and mouse cells de novo formation of a satellite DNA 

based artificial chromosome (SATAC, also referred to as ACes) can occur 

in an acrocentric chromosome where the short arm contains only 

pericentric heterochromatin, the rDNA array, and telomere sequences. 

1 5 Plant species may not have any acrocentric chromosomes with the same 
physical structure described, but ''megareplicator" DNA sequences reside 
in the plant rDNA arrays, also known as the nucleolar organizing regions 
(NOR). A structure like those seen in acrocentric mammalian 
chromosomes can be generated using site-specific recombination between 

20 appropriate arms of plant chromosomes. 
Approach 

Qin etaL ((1994) Proc. Natl. Acad. Sci. U.S.A. 37:1706-1710, 
1 994) describes crossing two Nicotiana tabacum transgenic plants. One 
plant contains a construct encoding a promoterless hygromycin-resistance 
25 gene preceded by a lox site ilox-hpt), the other plant carries a construct 
containing a cauliflower mosaic virus 35S promoter linked to a lox 
sequence and the ere DNA recombinase coding region (35S-/ox-cre). The 
constructs were introduced separately by infecting leaf explants with 
agrobacterium tumefaciens which carries the kanamycin-resistance gene 
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(Kan'^). The resultant Kan^ transgenic plants were crossed. Plants that 
carried the appropriate DNA recombination event were identified by 
hygromycin-resistance. 

5 Modification of the above for generation of ACes 

The Kan" cuitivars are initially screened, such as by FISH, to 

identify two sets of candidate transgenic plants. One set has one 

construct integrated in regions adjacent to the pericentric heterochromatin 

on the short arm of any chromosome. The second set of candidate plants 

10 has the other construct integrated in the NOR region of appropriate 

chromosomes. To obtain reciprocal translocation both sites must be in 

the same orientation. Therefore a series of crosses are required, Kan" 

plants generated, and FISH analyses performed to identify the appropriate 

''acrocentric'" plant chromosome for de novo plant ACes formation. 

15 2. Bacteriophage lambda integrase-based site-specific 

recombination system 

An integral part of the platform technology includes a site-specific 

recombination system that allows the placement of selected gene targets 

or genomic fragments onto the platform chromosomes. Any such system 

20 may be used. In particular, a method is provided for insertion of 

additional DNA fragments into the platform chromosome residing in the 
cell via sequence-specific recombination using the recombinase activity of 
the bacteriophage lambda integrase. The lambda integrase system is 
exemplary of the recombination systems contemplated for ACes. Any 

25 known recombination system, including any described herein, particularly 
any that operates without the need for additional factors or that, by virtue 
of mutation, does not require additional factors, is contemplated. 
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As noted the lambda integrase system provided herein can be used 

with natural chromosomes and artificial chromosomes in addition to 

ACes. Single or a plurality of recombination sites, which may be the 

same or different, are introduced into artificial chromosomes to produce 

5 artificial chromosome platforms. 

3. Creation of bacteriophage lambda integrase site-specific 
recombination system 

The lambda phage-encoded integrase (designated Int) is a 

prototypical member of the integrase family. Int effects integration and 

10 excision of the phage in and out of the E co// genome via recombination 
between pairs of attachment sites designated affB/affP and attUatiR, 
Each att site contains two inverted 9 base pair core Int binding sites and a 
7 base pair overlap region that is identical in wild-type att sites. Each 
site, except for attB contains additional Int binding sites. In flanking 

15 regions, there are recognition sequences for accessory DNA binding 
proteins, such as integration host factor (IHF), factor for inversion 
stimulation (FIS) and the phage encoded excision protein (XIS). Except 
for attB, Int is a heterobivalent DNA-binding protein and, with assistance 
from the accessory proteins and negative DNA supercoiling, binds 

20 simultaneously to core and arm sites within the same att site. 

int, like Cre and FLP, executes an ordered sequential pair of strand 
exchanges during integrative and excisive recombination. The natural 
pairs of target sequences for Int, attB and attP or a«L and affR are 
located on the same or different DNA molecules resulting in intra or 

25 Intermolecular recombination, respectively. For example, intramolecular 
recombination occurs between inversely oriented a«B and attP, or 
between aftL and attR sequences, respectively, leading to inversion of the 
intervening DNA segment. 
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Like the recombinase systems, such as Cre and FLP, int directs 
site-specific recombination. Unlike the other systems, such Cre and FLP, 
Int generally requires additional protein factors for integrative and excisive 
recombination and negative supercoiling for integrative recombination. 
5 Hence, the Int system had not been used in eukaryotic targeting systems. 
Mutant Int proteins, designated Int-h (E174K) and a derivative 
thereof Int-h/218(E174K/E218K) do not require accessory proteins to 
perform intramolecular integrative and excisive recombination in co- 
transfection assays in human cells (Lorbach et al. (2000) J MoL Biol. 

10 256:1 175-1 181); wild-type Int does not catalyze intramolecular 
recombination in human cells harboring target sites affB and affP. 
Hence it had been demonstrated that mutant Int can catalyze factor- 
independent recombination events in human cells. 

There has been no demonstration by others that this system can be 

15 used for engineering of eukaryotic genomes or chromosomes. Provided 
herein are chromosomes, including artificial chromosomes, such as but 
not limited to ACes that contain att sites (e.g., platform ACes), and the 
use of such chromosomes for targeted Integration of heterologous DNA 
into such chromosomes in eukaryotic cells, including animal, such as 

20 rodent and human, and plant cells. Mutant Int provided herein is shown 
to effect site-directed recombination between sites in artificial 
chromosomes and vectors containing cognate sites. 

An additional component of the chromosome-based platform 
technology is the site-specific integration of target DNA sequences onto 

25 the platform. For this the native bacteriophage lambda integrase has 
been modified to carry out this sequence specific DNA recombination 
event in eukaryotic cells. The bacteriophage lambda integrase and its 
cognate DNA substrate att is a member of the site-specific recombinase 
family that also includes the bacteriophage PI Cre/lox system as well as 
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the Saccharomyces cerevisiae 2 micron based FLP/FRT system (see, e.g., 
Landy (1989) Ann. Rev. Biochem 55:913-949; Hoess etal. (1982) Proc. 
Natl. Acad. Sci. U.S.A. 75:3398-3402; Broach ef a/. (1982) Cell 29:227 - 
234). 

5 By combining DNA endonuclease and DNA ligase activity these 

recombinases recognize and catalyze DNA exchanges between sequences 
flanking the recognition site. During the integration of lambda genome 
into the E coll (lambda recombination) genome, the phage integrase (INT) 
in association with accessory proteins catalyzes the DNA exchange 

10 between the attP site of the phage genome and the attB site of the 

bacterial genome resulting in the formation of attL and attR sites (Figure 
6). The engineered bacteriophage lambda integrase has been produced 
herein to carry out an intermolecular DNA recombination event between 
an incoming DNA molecule (primarily on a vector containing the bacterial 

1 5 attB site) and the chromosome-based platform carrying the lambda attP 
sequence independent of lambda bacteriophage or bacterial accessory 
proteins. 

In contrast to the bi-directional Cre/lox and FLP/FRT system, the 

engineered lambda recombination system derived for chromosome-based 

20 platform technology is advantageously unidirectional because accessory 

proteins, which are absent, are required for excision of integrated nucleic 

acid upon further exposure to the lambda Int recombinase. 

4. Creation of platform chromosome containing single or 
muitiple sequence-specific recombination sites 

25 a. IMuitiple sites 

For the creation of a platform chromosome containing multiple, 

sequence-specific recombination sites, artificial chromosomes are 

produced as depicted in Figure 5 and Example 3. As discussed above, 

artificial chromosomes can be produced using any suitable methodology, 
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including those described in U.S. Patent Nos. 5,288,625; 5,712,134; 
5,891,691; 6,025,155. Briefly, to prepare artificial chromosomes 
containing multiple recombination (e.g., integration) sites, nucleic acid 
(either in the form a one or more plasmids, such as the plasmid 

5 pSV40193attPsensePUR set forth in Example 3) is targeted into an 
amplifiable region of a chromosome, such as the pericentric region of a 
chromosome. Among such regions are the rDNA gene loci in acrocentric 
mammalian chromosomes. Hence, targeting nucleic acid for integration 
into the rDNA region of mammalian acrocentric chromosomes can include 

10 the mouse rDNA fragments (for targeting into rodent cell lines) or large 
human rDNA regions on BAC/PAC vectors (or subclones thereof in 
standard vectors) for targeting into human acrocentric chromosomes, 
such as for human gene therapy applications. The targeting nucleic acid 
generally includes a detectable or selectable marker, such as antibiotic 

15 resistance, such as puromycin and hygromycin, a recombination site 
(such as attP, attB, attL, attR or the like), and/or human selectable 
markers as required for gene therapy applications. Cells are grown under 
conditions that result in amplification and ultimately production of ACes 
artificial chromosomes having multiple recombination (e.g., integration) 

20 sites therein. ACes having the desired size are selected for further 
engineering. 

b. Creation of platform chromosome containing a 
single sequence-specific recombination site 

In this method a mammalian platform artificial chromosome is 

25 generated containing a single sequence-specific recombination site- In 

the Example below, this approach is demonstrated using a puromycin 

resistance marker for selection and a mouse rDNA fragment for targeting 

Into the rDNA locus on mouse acrocentric chromosomes. Other selection 

markers and targeting DNA sequences as desired and known to those of 
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skill in the art can be used. Additional resistance markers include genes 
conferring resistance to the antibiotics neomycin, blasticidin, hygromycin 
and zeocin. For applications, such as gene therapy in which potentially 
immunogenic responses are to be avoided, host, such as human, derived 
5 selectable markers or markers detectable with monoclonal antibodies 
(MAb) followed by fluorescent activated cell sorting (FACS) can be used. 
Examples in this class include, but are not limited to: human nerve growth 
factor receptor (detection with MAb); truncated human growth factor 
receptor (detection with MAb); mutant human dihydrofolate reductase 
10 (DHFR; detectable using a fluorescent methotrexate substrate); secreted 
alkaline phosphatase (SEAP; detectable with fluorescent substrate); 
thymidylate synthase (TS; confers resistance to fluorodeoxyUridine); 
human CAD gene (confers resistance to N-phosphonacetyl-L-aspartate 
(PALA)). 

15 To construct a platform artificial chromosome with a single site, an 

ACes artificial chromosome (or other artificial chromosome of interest) 
can be produced containing a selectable marker. A single sequence 
specific recombination site is targeted onto ACes via homologous 
recombination. For this, DNA sequences containing the site-specific 

20 recombination sequence are flanked with DNA sequences homologous to 
a selected sequence in the chromosome. For example, when using a 
chromosome containing rDNA or satellite DNA, such DNA can be used as 
homologous sequences to target the site-specific recombination sequence 
onto the chromosome. A vector is designed to have these homologous 

25 sequences flanking the site-specific recombination site and, after the 
appropriate restriction enzyme digest to generate free ends of homology 
to the chromosome, the DNA is transfected into cells harboring the 
chromosome. After transfection and integration of the site-specific 
cassette, homologous recombination events onto the platform 
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chromosome are subcloned and identified, for example by screening 

single cell subclones via expression of resistance or a fluorescent marker 

and PCR analysis. In one embodiment, a platform artificial chromosome, 

such as a platform ACes, that contains a single copy of the recombination 

5 site is selected. Examples 2B and 2D exemplify the process, and Figure 3 

provides a diagram depicting one method for the creation of a platform 

mammalian chromosome containing a single sequence-specific 

recombination site. 

5. Lambda integrase mediated recombination of target gene 
1 0 expression vector onto platform chromosome 

The third component of the chromosome-based platform 

technology involves the use of target gene expression vectors carrying, 

for example, genes for gene therapy, genes for transgenic animal or plant 

production, and those required for cellular protein production of interest. 

15 Using lambda integrase mediated site-specific recombination, or any other 
recombinase-mediated site-specific recombination, the target gene 
expression vectors are introduced onto the selected chromosome 
platform. The use of target gene expression vector permits use of the de 
novo generated chromosome-based platforms for a wide range of gene 

20 targets. Furthermore, chromosome platforms containing multiple attP 
sites provides the opportunity to incorporate multiple gene targets onto a 
single platform, thereby providing for expression of multiple gene targets, 
including the expression of cellular and genetic regulatory genes and the 
expression of all or parts of metabolic pathways. In addition to 

25 expressing small target genes, such as cDNA and hybrid cDNA/artificial 
Intron constructs, the chromosome-based platform can be used for 
engineering and expressing large genomic fragments carrying target genes 
along with its endogenous genomic promoter sequences. This is of 
importance, for example, where the therapy requires precise cell specific 
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expression and in instances where expression is best achieved from 
genomic clones rather than cDNA clones. Figure 9 provides a diagram 
summarizing one embodiment of the chromosome-based technology. 

A feature of the target gene expression vector that is of interest to 
5 include is a promoterless marker gene, which as exemplified (see. Figure 
9) contains an upstream atlB site (marlcer 2 on Figure 9). The nucleic 
acid encoding the marker is not expressed unless it is placed downstream 
from a promoter sequence. Using the recombinase technology provided 
herein, such as the lambda integrase technology MINTe^74r on figure 8) 

10 provided herein, site-specific recombination between the attB site on the 
vector and the promoter-attP site (in the "sense" orientation) on the 
chromosome-based platform results in the expression of marker 2 on the 
target gene expression vector, thereby providing a positive selection for 
the lambda INT mediated site-specific recombination event. Site-specific 

1 5 recombination events on the chromosome-based platform versus random 
integrations next to a promoter in the genome (false positive) can be 
quickly screened by designing primers to detect the correct event by PGR. 
Examples of suitable marker 2 genes, include, but are not lim'rted to, 
genes that confer resistance to toxic compounds or antibiotics, 

20 fluorescence activated cell sorting (FACS) sortable cell surface markers 
and various fluorescent markers. Examples of these genes include, but 
are not limited to, human L26a'^ (human homolog of Saccharomyces 
cerevisiae CYH° gene), neomycin, puromycin, blasticidin, CD24 (see, e.g., 
US Patents 5,804,177 and 6,074,836), truncated CD4, truncated low 

25 affinity nerve growth factor receptor (LNGFR), truncated LDL receptor, 
truncated human growth hormone receptor, GFP, RFP, BFP. 

The target gene expression vectors contain a gene (target gene) for 
expression from the chromosome platform. The target gene can be 
expressed using various constitutive or regulated promoter systems 
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across various mammalian species. For the expression of multiple target 
genes within the same target gene expression vector, the expression of 
the multiple targets can be coordinately regulated via viral-based or 
human internal ribosome entry site (IRES) elements (see, e.g., Jackson et 
5 a/. (1990) Trends Biochem ScL 15: 477-83; Oumard etal. (2000) Mot. 
Cell. Biol. 20: 2755-2759). Furthermore, using IRES type elements linked 
to a downstream fluorescent marker, e.flr,, green, red or blue fluorescent 
proteins (GFP, RFP, BFP) allows for the identification of high expressing 
clones from the integrated target gene expression vector. 

10 In certain embodiments described herein, the promoterless marker 

can be transcriptionally downstream of the heterologous nucleic acid, 
wherein the heterologous nucleic acid encodes a heterologous protein, 
and wherein the expression level of the selectable marker is 
transcriptionally linked to the expression level of the heterologous protein. 

15 In addition, the selectable marker and the heterologous nucleic acid can 
be transcriptionally linked by the presence of a IRES between them. As 
set forth herein the selectable marker is selected from the group 
consisting of an antibiotic resistance gene, and a detectable protein, 
wherein the detectable protein is chromogenic or fluorescent. 

20 Expression from the target gene expression vector integrated onto the 
chromosome-based platform can be further enhanced using genomic 
insulator/boundary elements. The incorporation of insulator sequences 
into the target gene expression vector helps define boundaries in 
chromatin structure and thus minimizes influence of chromatin position 

25 effects/gene silencing on the expression of the target gene (Bell et al. 
(1 999) Current Opinion in Genetics and Development 5:1 91 -1 98; Emery 
etal. (2000) Proc. Natl. Acad. ScL U.S.A. 57:9150-9155). Examples of 
insulator elements that can be included onto target gene expression 
vector in order to optimize expression include, but are not limited to: 
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1 ) chicken iS-globin HS4 element (Prioleau et aL (1 999) EMBO J 
18: 4035-4048); 

2) matrix attachment regions (MAR; see, e.g., Ramakrishnan et 
aL (2000) Mof Cell. Biol. 20:868-877); 

5 3) scaffold attachment regions (SAR; see, e.g., Auten etal. 

(1999) Human Gene Therapy /0: 1389-1 399); and 
4) universal chromatin opening elements (UCOE; WO/0005393 
and WO/0224930) 

The copy number of the target gene can be controlled by 
10 sequentially adding multiple target gene expression vectors containing the 
target gene onto multiple integration sites on the chromosome platform. 
Likewise, the copy number of the target gene can be controlled within an 
individual target gene expression vector by the addition of DNA 
sequences that promote gene amplification. For example, gene 
15 amplification can be induced utilizing the dihydrofolate reductase (DHFR) 
minigene with subsequent selection with methotrexate (see, e.g., 
Schimke (1984) Ce// 37:705-713) or amplification promoting sequences 
from the rDNA locus (see, e.g., Wegner etal. (1989) Nucl. Acids Res. 17: 
9909-9932). 

20 6. Platforms with other recomblnase system sites 

A "double lox" targeting strategy mediated by Cre-recombinase 
(Bethke etal. (1997) Nucl. Acids Res. 25:2828-2834) can be used. This 
strategy employs a pair of heterospecific lox sites— /oxA and /oxB, which 
differ by one nucleotide in the 8 bp spacer region. Both sites are 

25 engineered into the artificial chromosome and also onto the targeting DNA 
vector. This allows for a direct site-specific insertion of a commercially 
relevant gene or genes by a Cre-catalyzed double crossover event. In 
essence a platform ACes is engineered with a hygromycin-resistance gene 
flanked by the double lox sites generating lox- ACes, which is maintained 
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in the thymidine kinase deficient cell, LMtk(-), The gene of interest, for 
example, for testing purposes, the green fluorescence protein gene, GFP 
and a HSV thymidine kinase gene (tk) marker, are engineered between the 
appropriate lox sites of the targeting vector. The vector DNA is 
5 cotransfected with plasmid pBS185 (Life Technologies) encoding the Cre 
recombinase gene into mammalian cells maintaining the dual-/ox artificial 
chromosome. Transient expression of the Cre recombinase catalyzes the 
site-specific insertion of the gene and the tk-gene onto the artificial 
chromosome. The transfected cells are grown in HAT medium that 

1 0 selects for only those cells that have integrated and expressed the 

thymidine kinase gene. The HAT" colonies are screened by PGR analyses 
to identify artificial chromosomes with the desired insertion. 

To generate the lox-ACes, Lambda-Hyg"-/ox DNA is transfected 
into the LMtk(-) cell line harboring the precursor ACes. Hygromycin- 

15 resistant colonies are analyzed by FISH and Southern blotting for the 
presence of a single copy insert on the ACes, 

To demonstrate the gene replacement technology, cell lines 
containing candidate lox-ACes are cotransfected with pTK-GFP-/ox and 
pBS185 (encoding the Cre recombinase gene) DNA. After transfection, 

20 transient expression of plasmid pBS185 will provide sufficient burst of 
Cre recombinase activity to catalyze DNA recombination at the lox sites. 
Thus, a double crossover event between the ACes target and the 
exogenous targeting plasmid carrying the lox/K and /oxB permits the 
simple replacement of the hygromycin-resistance gene on the lox-ACes 

25 for the tk-GFP cassette from the targeting plasmid, with no integration of 
vector DNA. Transfected cells are grown in HAT-media to select for tk- 
expression. Correct targeting will result in the generation of HAT'^, 
hygromycin sensitive, and green fluorescent cells. The desired integration 
event is verified by Southern and PGR analyses. Specific PGR primer sets 
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are used to amplify DNA sequences flanking the individual /oxA and foxB 
sites on the lox-ACes before and after homologous recombination. 
D. Exemplary applications of the Platform ACes 

Platform ACes are applicable and tractable for different/optimized 
5 cell lines. Those that Include a fluorescent marker, for example, can be 
purified and isolated using fluorescent activated cell sorting (FACS), and 
subsequently delivered to a target cell. Those with selectable markers 
provide for efficient selection and provide a growth advantage. Platform 
ACes allow multiple payload delivery of donor target vectors via a 
10 positive-selection site-specific, recombination system, and they allow for 
the inclusion of additional genetic factors that improve protein production 
and protein quality. 

The construction and use of the platform ACes as provided for 
each application may be similarly applied to other applications. Particular 
15 descriptions are for exemplification. 

1 . Cellular Protein Production Platform ACes (CPP ACes) 
As described herein, ACes can be produced from acrocentric 
chromosomes in rodent (mouse, hamster) cell lines via megareplicator 
induced amplification of heterochromatin/rDNA sequences. Such ACes 
20 are ideal for cellular protein production as well as other applications 
described herein and known to those of skill in the art. ACes platforms 
that contain a plurality of recombination sites are particularly suitable for 
engineering as cellular protein production systems. 

In one embodiment, CPP ACes involve a two-component system: 
25 the platform chromosome containing multiple engineering sites and the 
donor target vector containing a platform-specific recombination site with 
designed expression cassettes (see Figure 9). 

The platform ACes can be produced from any artificial 
chromosome, particularly the amplification-based artificial chromosomes. 



-64- 

For exemplification, they are produced from rodent artificial chromosomes 
produced from acrocentric chromosomes using the technology of U,S- 
Patent Nos. 6,077,697 and 6,025,155 and published International PCT 
application No. WO 97/40183, in which nucleic acid is targeted to the 
5 pericentric heterochromatic, and, particularly into rDNA to initiate the 
replication event{s). The ACes can be produced directly in the chosen 
cellular protein production cell lines, such as, but not limited to, CHO 
cells, hybrldomas, plant cells, plant tissues, plant protoplasts, stem cells 
and plant calli. 

10 a. Platform Construction 

In the exemplary embodiment, the initial de novo platform 
construction requires co-transfecting with excess targeting DNA, such as, 
rDNA or lambda DNA without an attP region, and an engineered 
selectable marker. The engineered selectable marker should contain 

15 promoter, generally a constitutive promoter, such as human, viral, i.e., 
adenovirus or SV40 promoter, including the human ferritin heavy chain 
promoter (SEQ ID NO: 128), SV40 and EF1 a promoters, to control 
expression of a marker gene that provides a selective growth advantage 
to the cell. An example of such a marker gene is the E coli hisD gene 

20 (encoding histidinol dehydrogenase) which is homologous and analogous 
to the S. typhimurium hisD a dominant marker selection system for 
mammalian cells previously described (see, Hartman et al. (1988) Proc. 
Natl. Acad. Sci. U.S.A. 85:8047-8051). Since histidine is an essential 
amino acid in mammals and a nutritional requirement in cell culture, the E 

25 coli hisD gene can be used to select for histidine prototrophy in defined 
media. Furthermore more stringent selection can be placed on the cells 
by including histinol in the medium. Histidinol is itself permeable and 
toxic to cells- The hisD provides a means of detoxification. 



I I 



-65- 

Placed between the promoter and the marker gene is the 
bacteriophage lambda attP site to use the bacteriophage lambda integrase 
dependent site-specific recombination system (described herein). The 
insertion of an attP site downstream of a promoter element provide 
5 forward selection of site-specific recombination events onto the platform 
ACes. 

b. Donor Target Vector Construction 

A second component of the CPP platform ACes system involves 
the construction of donor target vectors containing a gene product(s) of 

10 interest for the CPP platform ACes. Individual donor target vectors can 
be designed for each gene product to be expressed thus enabling 
maximum usage of a de novo constructed platform ACes, so that one or 
a few CPP platform ACes will be required for many gene targets. 

A key feature of the donor vector target is the promoterless marker 

15 gene containing an upstream attB site (marker 2 on figure 9). Normally 
the marker would not be expressed unless it is placed downstream of a 
promoter sequence. As discussed above, using the lambda integrase 
technology (>IINTe^74r on Figure 8 and Figure 9), site-specific 
recombination between the attB site on the vector and the promoter-affP 

20 site on the CPP platform ACes result in the expression of the donor target 
vector marker providing positive selection for the site-specific event. Site- 
specific recombination events on the CPP ACes versus random 
integrations next to a promoter in the genome (false positive) can be 
quickly screened by designing primers to detect the correct event by PCR. 

25 In addition, since the lambda integrase reaction is unidirectional, i.e. 
excision reaction is not possible, a number of unique targets can be 
loaded onto the CPP platform ACes limited only by the number of markers 
available. 
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Additional features of the donor target vector include gene target 
expression cassettes flanked by either chromatin insulator regions, matrix 
attachment regions (MAR) or scaffold attachment regions (SAR). The use 
of these regions will provide a more "open" chromatin environment for 

5 gene expression and help alleviate silencing. An example of such a 
cassette for expressing a monoclonal antibody is described. For this 
purpose, a strong constitutive promoter, e.g. chicken iS-actin or RNA Poll, 
is used to drive the expression of the heavy and light chain open reading 
frames. The heavy and light chain sequences flank a nonattenuated 

10 human IRES (IRESh; from the 5'UTR of NRF1 gene; see Oumard et al., 
2000, MoL and Cell Biol. . 20(8):2755-2759) element thereby 
coordinating transcription of both heavy and light chain sequence. Distal 
to the light chain open reading frame resides an additional viral encoded 
IRES (IRESv modified ECMV internal ribosomal entry site (IRES)) element 

15 attenuating the expression of the fluorescent marker gene hrGFP from 
Renilla (Stratagene). By linking the hrGFP with an attenuated IRES, the 
heavy and light chains along with the hrGFP are monoclstronic. Thus, the 
identification of hrGFP fluorescing cells will provide a means to detect 
protein producing cells. In addition, high producing cell lines can be 

20 identified and isolated by FACS thereby decreasing the time frame in 

finding high expressers. Functional monoclonal antibody will be 

confirmed by ELISA. 

c. Additional components in cellular protein production 
platform ACes (CPP Aces) 

25 In addition to the aforementioned CPP ACes system, other genetic 

factors can be included to enhance the yield and quality of the expressed 

protein. Again to provide maximum flexibility, these additional factors 

can be inserted onto the CPP platform ACes by ><INTE174R dependent 

site-specific recombination. Other factors that could be used with a CPP 
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Platform ACes include for example, adenovirus El a transactivation 

system which upregulates both cellular and viral promoters (see, e.g., 

Svensson and Akusjarvi (1984) EMBO 3:789-794; and US patents 

5,866,359; 4,775,630 and 4,920,211), 

5 d. Targets for CHO-ACes engineering to enhance cell 

growth, such as CHO cell growth and protein 
production/ quality 

If adding these additional factors onto the CPP ACes is not prudent 

or desired, the host cell, CHO cells, can be engineered to express these 

10 factors (see, below, targets for CH0->4Ces engineering to enhance CHO 
cell growth and protein production/quality). Additional factors to consider 
including are addition of insulin or IGF-1 to sustain viabililty; 
human sialyltransferases or related factors to produce more human-like 
glycoproteins; expression of factors to decrease ammonium accumulation 

1 5 during cell growth; expression of factors to inhibit apoptosis; expression 

of factors to improve protein secretion and protein folding; and expression 

of factors to permit serum-free transf action and selection. 

1) Addition of Insulin or IGF-I to sustain 
viabililty 

20 Stimulatory factors and/or their receptors are expressed to set up 

an autocrine loop, to improve cell growth, such as CHO cell growth. Two 
exemplary candidates are insulin and IGF-1 (see, Biotechnol Prog 2000 
Sep;16(5):693-7). Insulin is the most commonly used growth factor for 
sustaining cell growth and viability in serum-free Chinese hamster ovary 

25 (CHO) cell cultures. Insulin and IGF-1 analog (LongR(3) serve as growth 
and viability factors for CHO cells. 

CHO cells were modified to produce higher levels of essential 
nutrients and factors. A serum-free (SF) medium for dihydrofolate 
reductase-deficient Chinese hamster ovary cells (DG44 cells) was 

30 prepared. Chinese hamster ovary cells (DG44 cells), which are normally 
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maintained In 10% serum medium, were gradually weaned to 0.5% 
serum medium to increase the probability of successful growth in SF 
medium (see, Kim etal. (199) In Vitro Cell Dev Biol Anim 35W:1 78-82). 
A SF medium (SF-DG44) was formulated by supplementing the basal 
medium with these components; basal medium was prepared by 
supplementing Dulbecco's modified Eagle's medium and Ham's nutrient 
mixture F12 with hypoxanthine (10 mg/l) and thymidine (10 mg/l). 
Development of a SF medium for DG44 cells was facilitated using a 
Plackett-Burman design technique and weaning of cells. 

2) Human sialyltransferases or related 
factors to produce more human-like 
glycoproteins 

CHO cells have been modified by increasing their ability to process 
protein via addition of complex carbohydrates. This has been achieved by 
overexpression of relevant processing enzymes, or in some cases, 
reducing expression of relevant enzymes (see, Bragonzi et al. (2000) 
Biochim Biophys Acta 1 474(3) :27 3-282; see, also Weikert etaL (1999) 
Nature biotech. / 7: 1 1 1 6-1 1 1 21 ; Ferrari J et al. ( 1 998) Biotechnol Bioeng 
60(5) :589'95). A CHO cell line expressing alpha2,6-sialyltransf erase was 
developed for the production of human-like sialylated recombinant 
glycoproteins. The sialylation defect of CHO cells can be corrected by 
transfecting the alpha2,6-sialyltransferase (alpha2,6-ST) cDNA into the 
cells. Glycoproteins produced by such CHO cells display alpha2,6-and 
alpha2,3-linked terminal sialic acid residues, similar to human 
glycoproteins. 

As another example for. improving the production of human-like 
sialylated recombinant glycoproteins, a CHO cell line has been developed 
that constitutively expresses sialidase antisense RNA (see, Ferrari J et al. 
(1998) Biotechnol Bioeng 50^5^:589-95) . Several antisense expression 
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vectors were prepared using different regions of the slalidase gene. Co- 
transfection of the antisense constructs with a vector conferring 
puromycin resistance gave rise to over 40 puromycin resistant clones that 
were screened for sialidase activity. A 5' 474 bp coding segment of the 
5 sialidase cDNA, in the inverted orientation in an SV 40-based expression 
vector, gave maximal reduction of the sialidase activity to about 40% 
wild-type values. 

Oligosaccharide biosynthesis pathways in mammalian cells have 
been engineered for generation of recombinant glycoproteins (see, e.g., 

10 Sburlati (1998) Biotechnol Prog M^i^: 189-92), which describes a Chinese 
hamster ovary (CHO) cell line capable of producing bisected 
oligosaccharides on glycoproteins. This cell line was created by 
overexpression of a recombinant N-acetylglucosaminyltransferase ill (GnT- 
III) (see, also, Prati et al. (1998) Biotechnol Bioeng 5SW:445-50, which 

15 describes antisense strategies for glycosylation engineering of CHO cells). 

3) Expression of factors to decrease 
ammonium accumulation during cell 
growth 

Excess ammonium, which is a by-product of CHO cell metabolism 
20 can have detrimental effects on cell growth and protein quality (see, Yang 
etal. (2000) Biotechnol Bioeng 68(4):370-80). To solve this problem 
ammonium levels were modified by overexpressing carbamoyl phosphate 
synthetase I and ornithine transcarbamoylase or glutamine synthetase in 
CHO cells. Such modification resulted in reduced ammonium levels 
25 observed and an increase in the growth rate (see Kim et al. (2000) J 
Biotechnol 8 7 (2''3) 'A 29-40; and Enosawa etal. (1997) Cell Transplant 
6f5):537'40). 

4) Expression of factors to improve protein 
secretion and protein folding 
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Overexpression of relevant enzymes can be engineered into the 
ACes to improve protein secretion and folding. 

5) Expression of factors to permit serum-free 
transfection and selection 

5 It is advantageous to have the ability to convert CHO cells in 

suspension growing in serum free medium to adherence with out having 

to resort to serum addition. Laminin or fibronectin addition is sufficient to 

make cells adherent (see, e.g., Zaworski etaL (1993) Biotechniques 

15(5):S63-6) so that expressing either of these genes in CHO cells under 

10 an inducible promoter should allow for reversible shift to adherence 
without requiring serum addition. 

2. natform ACes and Gene Therapy 
The platform ACes provided herein are contemplated for use in 
mammalian gene therapy, particularly human gene therapy. Human ACes 

1 5 can be derived from human acrocentric chromosomes from human host 
cells, in which the amplified sequences are heterochromatic and/or human 
rDNA. Different platform ACes applicable for different tissue cell types 
are provided. The ACes for gene therapy can contain a single copy of a 
therapeutic gene inserted into a defined location on platform ACes. 

20 Therapeutic genes include genomic clones, cDNA, hybrid genes and other 
combinations of sequences. Preferred selectable markers are those from 
the mammalian host, such as human derived factors so that they are non- 
immunogenic, non-toxic and allow for efficient selection, such as by 
FACS and/or drug resistance. 

•25 Platform ACes, useful for gene therapy and other applications, as 

noted herein, can be generated by megareplicator dependent 
amplification, such as by the methods in U.S. Patent Nos. 6,077,697 and 
6,025,155 and published International PCT application No. 
WO 97/40183. In one embodiment, human ACes are produced using 
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human rDNA constructs that target rDNA arrays on human acrocentric 
chromosomes and induce the megareplicator In human cells, particularly 
in primary cell lines (with sufficient number of doublings to form the 
ACes) or stem cells (such as hematopoietic stem cells, mesenchymal 
5 stem cells, adult stem cells or embryonic stem cells) to avoid the 

introduction of potentially harmful rearranged DNA sequences present in 
many transformed cell lines. Megareplicator induced ACes formation can 
result in multiple copies of targeting DNA/selectable markers in each 
amplification block on both chromosomal arms of the platform ACes. 

10 In view of the considerations regarding immunogenicity and 

toxicity, the production of human platform ACes for gene therapy 
applications employs a two component system analogous to the platform 
• ACes designed for cellular protein production (CPP platform ACes). The 
system includes a platform chromosome of entirely human DNA origin 

15 containing multiple engineering sites and a gene target vector carrying the 
therapeutic gene of interest. 

a- Platform Construction 
The initial de novo construction of the platform chromosome 
employs the co-transfection of excess targeting DNA and a selectable 

20 marker. In one embodiment, the DNA is targeted to the rDNA arrays on 
the human acrocentric chromosomes (chromosomes 13, 14, 15, 21 and 
22). For example, two large human rDNA containing PAC clones 18714 
and 1 8720 and the human PAC clone 558F8 are used for targeting 
(Genome Research (ML) now Incyte, BACPAC Resources, 747 52nd 

25 Street, Oakland CA). The mouse rDNA clone pFK161 (SEQ ID NO: 118), 
which was used to make the human SATAC from the 94-3 
hamster/human hybrid cell line (see, e.g., published International POT 
application No. WO 97/40183 and Csonka, etaJ, Journal of Cell Science 
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//5:3207-32161 and Example 1 for a description of pFK161) can also be 
used. 

For aninnal applications, selectable markers should be non- 
immunogenic in the animal, such as a human, and include, but are not 
5 limited to: human nerve growth factor receptor (detected with a MAb, 
such as described in US patent 6,365,373); truncated human growth 
factor receptor (detected with MAb), mutant human dihydrofolate 
reductase (DHFR; fluorescent MTX substrate available); secreted alkaline 
phosphatase (SEAP; fluorescent substrate available); human thymidylate 

10 synthase (TS; confers resistance to anti-cancer agent fluorodeoxyuridine); 
human glutathione S-transferase alpha (GSTA1; conjugates glutathione to 
the stem cell selective alkylator busulfan; chemoprotective selectable 
marker in CD34+ cells); CD24 cell surface antigen in hematopoietic stem 
cells; human CAD gene to confer resistance to N-phosphonacetyl-L- 

15 aspartate (PALA); human multi-drug reslstance-1 (MDR-1; P-glycoprotein 
surface protein selectable by increased drug resistance or enriched by 
FACS); human CD25 (IL-2ar; detectable by Mab-FITC); Methylguanine- 
DNA methyltransferase (MGMT; selectable by carmustine); and Cytidine 
deaminase (CD; selectable by Ara-C). 

20 Since megareplicator induced amplification generates multiple 

copies of the selectable marker, a second consideration for the selection 
of the human marker is the resulting dose of the expressed marker after 
ACes formation. High level of expression of certain markers may be 
detrimental to the cell and/or result in autoimmunity. One method to 

25 decrease the dose of the marker protein is by shortening its half-life, such 
as via the fusion of the well-conserved human ubiquitin tag (a 76 amino 
acid sequence) thus leading to increased turnover of the selectable 
marker. This has been used successfully for a number of reporter 
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systems including DHFR (see, e.g., Stack et al. (2000) Nature 
Biotechnology 298-1 302 and references cited therein). 

Using the ubiquitin tagged protein, a human selectable marker 
system analogous to the CPP ACes described herein is constructed. 
S Briefly, a tagged selectable marker, such as for example one of those 
described herein, is cloned downstream of an attP site and expressed 
from a human promoter. Exemplary promoters contemplated for use 
herein include, but are not limited to, the human ferritin heavy chain 
promoter {SEQ ID NO: 128); RNA Poll; EFIa; TR; glyceraldehyde-3- 

10 phosphate dehydrogenase core promoter (GAP); a GAP core promoter 
including a proximal insulin inducible element and the intervening GAP 
sequence; phosphofructokinase promoter; and phosphoglycerate kinase 
promoter. Also contemplated herein is an aldolase A promoter HI & H2 
(representing closely spaced transcriptional start sites) along with the 

15 proximal H enhancer. There are 4 promoters (e.g., transcriptional start 
sites) for this gene, each having different regulatory and tissue activity. 
The H (most proximal 2) promoters are ubiquitously expressed off the H 
enhancer. This resulting marker can then be co-transfected along with 
excess human rDNA targeting sequence into the host cells. An important 

20 criteria for the selection of the 

recipient cells is sufficient number of cell doublings for the formation and 
detection oi ACes. Accordingly, the co-transfections should be 
attempted in human primary cells that can be cultured for long periods of 
25 time, such as for example, stem ceils (e.g., hematopoietic, mesenchymal, 
adult or embryonic stem cells), or the like. Additional cell types, include, 
but are not limited to: single gene transfected cells exhibiting increased 
life-span; over-expressing c-myc cells, e.g. MSU1.1 (Morgan et al., 1991, 
Exp. Cell Res., Nov;197{1):125-136); over-expressing telomerase lines. 
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such as TERT cells; SV40 large T-antigen transfected lines; tumor cell 
lines, such as HT1080; and hybrid human cell lines, such as the 94-3 
hamster/human hybrid cell line. 

b. Gene Target Vector 
5 The second component of the GT platform ACes (GT ACes) system 

involves the use of engineered target vectors carrying the therapeutic 
gene of interest. These are introduced onto the GT platform ACes via 
site-specific recombination. As with the CPP ACes, the use of engineered 
target vectors maximizes the use of the de novo generated GT platform 

10 ACes for most gene targets. Furthermore, using lambda integrase 

technology, GT platform ACes containing multiple attP sites permits the 
opportunity to incorporate multiple therapeutic targets onto a single 
platform. This could be of value in cases where a defined therapy 
requires multiple gene targets, a single therapeutic target requires an 

15 additional gene regulatory factor or a GT ACes requires a "kill" switch. 

Similar to the CPP ACes, a feature of the gene target vector is the 
promoter/ess marker gene containing an upstream attB site (marker 2 on 
' Figure 9). Normally, the marker (in this case, a cell surface antigen that 
can be sorted by FACS would be ideal) would not be expressed unless it 

20 is placed downstream of a promoter sequence. Using the lambda 

integrase technology (>tlNTEi74j, figure 9), site-specific recombination 
between the attB site on the vector and the promoter- attP site on the GT 
platform ACes results in the expression of marker#2 on the gene target 
vector, i.e. positive selection for the site-specific event. Site-specific 

25 recombination events on the GT ACes versus random integrations next to 
a promoter in the genome (false positive) can be quickly screened by 
designing primers to detect the correct event by PGR. 

For expression of the therapeutic gene, human specific promoters, 
such as a ferritin heavy chain promoter (SEQ ID N0:128); EFIa or RNA 



-75- 

Poll, are used. These promoters are for high level expression of a cDNA 

encoded therapeutic protein. In addition to expressing cDNA (or even 

hybrid cDNA/artiflcial intron constructs), the GT platform ^Ces are used 

for engineering and expressing large genomic fragments carrying 

5 therapeutic genes of interest expressed from native promoter sequences. 

This is of importance in situations where the therapy requires precise cell 

specific expression or in instances where expression is best achieved 

from genomic clones versus cDNA. 

3. Selectable markers for use, for example, in Gene 
10 Therapy (GT) 

The following are selectable markers that can be incorporated into 
human ACes and used for selection. 

Dual Resistance to 4-Hydroperoxycyclophosphamide 
and Methotrexate by Retroviral Transfer of the Human 
15 Aldehyde Dehydrogenase Class 1 Gene and a Mutated 

Dihydrofolate Reductase Gene 

The genetic transfer of drug resistance to hematopoietic cells is one 

approach to overcoming myelosuppression caused by high-dose 

chemotherapy. Because cyclophosphamide (CTX) and methotrexate 

20 (MTX) are commonly used non-cross-resistant drugs, generation of dual 
drug resistance in hematopoietic cells that allows dose intensification may 
increase anti-tumor effects and circumvent the emergence of drug- 
resistant tumors, a retroviral vector containing a human cytosolic ALDH- 
1 -encoding DNA clone and a human doubly mutated DHFR-encoding 

25 clone (Phe22/Ser31 ; termed F/S in the description of constructs) to 

generate increased resistance to CTX and MTX were constructed (Takebe 
et aL (2001) Mol Ther 3(1 ):8B'96). This construct may be useful for 
protecting patients from high-dose CTX- and MTX-induced 
myelosuppression. ACes can be similarly constructed. 
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Multiple mechanisms of N-phosphonacetyi-L-aspartate 
resistance in human cell lines: carbamyl-P 
synthetase/aspartate transcarbamylase/dihydro-orotase 
gene amplification is frequent only when chromosome 
2 is rearranged 

Rodent cells resistant to N-phosphonacetyl-L-aspartate (PALA) 

invariably contain amplified carbamyl-P synthetase/aspartate 

transcarbamylase/dihydro-orotase (CAD) genes, usually in widely spaced 

tandem arrays present as extensions of the same chromosome arm that 

carries a single copy of CAD in normal cells (Smith etaL (1997) Proc. 

NatL Acad. ScL U.S.A. S4:1816-21). In contrast, amplification of CAD is 

very infrequent in several human tumor cell lines. Cell lines with minimal 

chromosomal rearrangement and with unrearranged copies of 

chromosome 2 rarely develop intrachromosomal amplifications of CAD. 

These cells frequently become resistant to PALA through a mechanism 

that increases the aspartate transcarbamylase activity with no increase in 

CAD copy number, or they obtain one extra copy of CAD by forming an 

isochromosome 2p or by retaining an extra copy of chromosome 2. In 

cells with multiple chromosomal aberrations and rearranged copies of 

chromosome 2, amplification of CAD as tandem arrays from rearranged 

chromosomes is the most frequent mechanism of PALA resistance. All of 

these different mechanisms of PALA resistance are blocked in normal 

human fibroblasts. Thus, ACes with multiple copies of the CAD gene 

would provide PALA resistance. 

Retroviral coexpression of thymidylate synthase and 
dihydrofolate reductase confers fluoropyrimidine and 
antifolate resistance 

Retroviral gene transfer of dominant selectable markers into 

hematopoietic cells can be used to select genetically modified cells in vivo 

or to attenuate the toxic effects of chemotherapeutic agents. Fantz etaL 

((1998) Biochem Biophys Res Comm 243(1hQA2) have shown that 



-77- 

retroviral gene transfer of thymidylate synthase (TS) confers resistance to 
TS directed anticancer agents and that co-expression of TS and 
dihydrofolate reductase (DHFR) confers resistance to TS and DHFR 
cytotoxic agents. Retroviral vectors encoding Escherichia coli TS, human 
5 TS, and the Tyr-to-His at residue 33 variant of human TS (Y33HhTS) 
were constructed and fibroblasts transfected with these vectors conferred 
comparable resistance to the TS-directed agent fluorodeoxyuridine 
(FdUrd, approximately 4-fold). Retroviral vectors that encode dual 
expression of Y33HhTS and the human L22Y DHFR (L22YhDHFR) 

10 variants conferred resistance to FdUrd (3- to 5-fold) and trimetrexate (30- 
to 140-fold). A L22YhDHFR-Y33HhTS chimeric retroviral vector was also 
constructed and transduced cells were resistant to FdUrd (3-fold), AG337 
(3-fold), trimetrexate (100~fold) and methotrexate (5-fold). These results 
show that recombinant retroviruses can be used to transfer the cDNA 

1 5 that encodes TS and DHFR and dual expression in transduced cells is 

sufficiently high to confer resistance to TS and DHFR directed anticancer 

agents. ACes can be similarly constructed. 

Human CD34+ cells do not express glutathione S- 
transferases alpha 

20 The expression of glutathione S-transferases alpha (GST alpha) in 

human hematopoietic CD34 + cells and bone marrow was studied using 
RT-PCR and immunoblotting (Czerwinski M, Kiem etal. (1997) Gene Ther 
4(3):268''70). The GSTA1 protein conjugates glutathione to the stem ceil 
selective alkylator busulfan. This reaction is the major pathway of 

25 elimination of the compound from the human body. Human hematopoietic 
CD34+ cells and bone marrow do not express GSTA1 message, which 
was present at a high level in liver, an organ relatively resistant to 
busulfan toxicity in comparison to bone marrow. Similarly, baboon 
CD34+ cells and dog bone marrow do not express GSTA1 . Thus, human 
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GSTA1 is a chemoprotective selectable marker In human stem cell gene 
therapy and could be employed In ACes construction. 

Selection of retrovirally transduced hematopoietic cells 
using CD24 as a marker of gene transfer 

Pawliuk etal. ((1994) Blood 84(9):286Q-2877) have investigated 

the use of a cell surface antigen as a dominant selectable marker to 

facilitate the detection and selection of retrovirally infected target cells. 

The small coding region of the human cell surface antigen CD24 

{approximately 240 bp) was introduced into a myeloproliferative sarcoma 

virus {MPSV)-based retroviral vector, which was then used to infect day 4 

5-fluorouracil {5-FU)-treated murine bone marrow cells. Within 48 hours 

of termination of the infection procedure CD24-expressing cells were 

selected by fluorescent-activated cell sorting (FACS) with an antibody 

directed against the CD24 antigen. Functional analysis of these cells 

showed that they included not only in vitro clonogenic progenitors and 

day 1 2 colony-forming unit-spleen but also cells capable of competitive 

long-term hematopoietic repopulatlon. Double-antibody labeling studies 

performed on recipients of retrovirally transduced marrow cells showed 

that some granulocytes, macrophages, erythrocytes, and, to a lesser 

extent, B and T lymphocytes still expressed the transduced CD24 gene at 

high levels 4 months later. No gross abnormalities in hematopoiesis were 

detected in mice repopulated with CD24-expressing cells. These results 

show that the use of the CD24 cell surface antigen as a retrovirally 

encoded marker permits rapid, efficient, and nontoxic selection in vitro of 

infected primary cells, facilitates tracking and phenotyping of their 

progeny, and provides a tool to identify elements that regulate the 

expression of transduced genes in the most primitive hematopoietic cells. 

ACes could be similarly constructed. 
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DeltahGHR, a biosafe cell surface-labeling molecule for 
analysis and selection of genetically transduced human 
cells 

A selectable marker for retroviral transduction and selection of 
5 human and murine cells is known (see, Garcia-Ortiz et al. (2000) Hum 
Gene Ther 1 1(2) :333'46). The molecule expressed on the cell surface of 
the transduced population is a truncated version of human growth 
hormone receptor (deltahGHR), capable of ligand (hGH) binding, but 
devoid of the domains involved in signal triggering. The engineered 

10 molecule is^stably expressed in the target cells as an inert protein unable 
to trigger proliferation or to rescue the cells from apoptosis after ligand 
binding. This new marker, has a wide application spectrum, since hGHR 
in the human adult is highly expressed only in liver cells, and lower levels 
have been reported in certain lymphocyte cell populations. The 

1 5 deltahGHR label has high biosafety potential, as it belongs to a well- 
characterized hormonal system that is nonessential in adults, and there is 
extensive clinical experience with hGH administration in humans. The 
differential binding properties of several monoclonal antibodies (MAbs) are 
used in a cell rescue method in which the antibody used to select 

20 deltahGHR-transduced cells is eluted by competition with hGH or, 

alternatively biotinylated hGH is used to capture tagged cells. In the latter 

system, the final purified population is recovered free of attached 

antibodies in hGH (a substance approved for human use)-containing 

medium. Such a system could be used to identify ACes containing cells. 

25 4. Transgenic models for evaluation of genes and 

discovery of new traits In plants 

Of interest is the use of plants and plant cells containing artificial 

chromosomes for the evaluation of new genetic combinations and 

discovery of new traits. Artificial chromosomes, by virtue of the fact that 

30 they can contain significant amounts of DNA can also therefore encode 
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numerous genes and accordingly a multiplicity of traits. It is 
contemplated here that artificial chromosomes, when formed from one 
plant species, can be evaluated In a second plant species. The resultant 
phenotypic changes observed, for example, can indicate the nature of the 
5 genes contained within the DNA contained within the artificial 
chromosome, and hence permit the Identification of novel genetic 
activities. Artificial chromosomes containing euchromatic DNA or partially 
containing euchromatic DNA can serve as a valuable source of new trarts 
when transferred to an alien plant cell environment. For example, *rt is 
10 contemplated that artificial chromosomes derived from dicot plant species 
can be introduced into monocot plant species by transferring a dicot 
artificial chromosome. The dicot artificial chromosome possessing a 
region of euchromatic DNA containing expressed genes. 

The artificial chromosomes can be designed to allow the artificial 
1 5 chromosome to recombine with the naturally occurring plant DNA in such 
a fashion that a large region of naturally occurring plant DNA becomes 
incorporated into the artificial chromosome. This allows the artificial 
chromosome to contain new genetic activities and hence carry novel 
traits. For example, an artificial chromosome can be introduced into a 
20 wild relative of a crop plant under conditions whereby a portion of the 
DNA present in the chromosomes of the wild relative is transferred to the 
artificial chromosome. After isolation of the artificial chromosome, this 
naturally occurring region of DNA from the wild relative, now located on 
the artificial chromosome can be introduced into the domesticated crop 
25 species and the genes encoded within the transferred DNA expressed and 
evaluated for utility. New traits and gene systems can be discovered in 
this fashion. The artificial chromosome can be modified to contain 
sequences that promote homologous recombination within plant cells, or 
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be modified to contain a genetic system tliat functions as a site-specific 
recombination system. 

Artificial chromosomes modified to recombine with plant DNA offer 
many advantages for the discovery and evaluation of traits in different 
5 plant species. When the artificial chromosome containing DNA from one 
plant species is introduced into a new plant species, new traits and genes 
can be introduced. This use of an artificial chromosome allows for the 
ability to overcome the sexual barrier that prevents transfer of genes from 
one plant species to another species. Using artificial chromosomes in this 

10 fashion allows for many potentially valuable traits to be identified 

including traits that are typically found in wild species. Other valuable 
applications for artificial chromosomes include the ability to transfer large 
regions of DNA from one plant species to another, such as DNA encoding 
potentially valuable traits such as altered oil, carbohydrate or protein 

1 5 composition, multiple genes encoding enzymes capable of producing 
valuable plant secondary metabolites, genetic systems encoding valuable 
agronomic traits such as disease and insect resistance, genes encoding 
functions that allow association with soil bacterium such as growth 
promoting bacteria or nitrogen fixing bacteria, or genes encoding traits 

20 that confer freezing, drought or other stress tolerances. In this fashion, 
artificial chromosomes can be used to discover regions of plant DNA that 
encode valuable traits. 

The artificial chromosome can also be designed to allow the 
transfer and subsequent incorporation of these valuable traits now located 

25 on the artificial chromosome into the natural chromosomes of a plant 
species. In this fashion the artificial chromosomes can be used to 
transfer large regions of DNA encoding traits normally found in one plant 
species into another plant species. In this fashion, it is possible to derive 
a plant cell that no longer needs to carry an artificial chromosome to 
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posses the novel traits Thus, the artificial chromosome would serve as 
the transfer mechanism to permit the formation of plants with greater 
degree of genetic diversity. 

The design of an artificial chromosome to accomplish the afore- 
5 mentioned purposes can include within the artificial chromosome the 
presence of specific DNA sequences capable of acting as sites for 
homologous recombination to take place. For example, the DNA 
sequence of Arabidopsis is now known. To construct an artificial 
chromosome capable of recombining with a specific region of Arabidopsis 

10 DNA, a sequence of Arabidopsis DNA, normally located near a 
chromosomal location encoding genes of potential interest can be 
introduced into an artificial chromosome by methods provided herein. It 
may be desirable to include a second region of DNA within the artificial 
chromosome that provides a second flanking sequence to the region 

1 5 encoding genes of potential interest, to promote a double recombination 
event which would ensure transfer of the entire chromosomal region, 
encoding genes of potential interest, to the artificial chromosome. The 
modified artificial chromosome, containing the DNA sequences capable of 
homologous recombination region, can then be introduced into 

20 Arabidopsis cells and the homologous recombination event selected. 

It is convenient to include a marker gene to allow for the selection 
of a homologous recombination event. The marker gene is preferably 
inactive unless activated by an appropriate homologous recombination 
event. For example, US 5,272,071, describes a method where an 

25 inactive plant gene is activated by a recombination event such that 

desired homologous recombination events can be easily scored. Similarly, 
US 5,501,967 describes a method for the selection of homologous 
recombination events by activation of a silent selection gene first 
introduced into the plant DNA, the gene being activated by an appropriate 
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homologous recombination event. Both of these methods can be applied 
to enable a selective process to be included to select for recombination 
between an artificial chronnosome and a plant chromosome. Once the 
homologous recombination event is detected, the artificial chromosome, 
5 once selected, is isolated and introduced into a recipient cell, for example, 
tobacco, corn, wheat or rice, and the expression of the newly introduced 
DNA sequences evaluated. 

Phenotypic changes in the recipient plant cells containing the 
artificial chromosome, or in regenerated plants containing the artificial 

1 0 chromosome, allows for the evaluation of the nature of the traits encoded 
by the Arabidopsis DNA, under conditions naturally found in plant cells, 
including the naturally occurring arrangement of DNA sequences 
responsible for the developmental control of the traits in the normal 
chromosomal environment. 

15 Traits such as durable fungal or bacterial disease resistance, new 

oil and carbohydrate compositions, valuable secondary metabolites such 
as phytosterols, flavonoids, efficient nitrogen fixation or mineral 
utilization, resistance to extremes of drought, heat or cold are all found 
within different populations of plant species and are often governed by 

20 multiple genes. The use of single gene transformation technologies does 
not permit the evaluation of the multiplicity of genes controlling many 
valuable traits. Thus, incorporation of these genes into artificial 
chromosomes allows the rapid evaluation of the utility of these genetic 
combinations in heterologous plant species. 

25 The large scale order and structure of the artificial chromosome 

provides a number of unique advantages in screening for new utilities or 
novel phenotypes within heterologous plant species. The size of new 
DNA that can be carried by an artificial chromosome can be millions of 
base pairs of DNA, representing potentially numerous genes that may 
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have novel utility in a heterologous plant cell. The artificial chromosome 
is a "natural" environment for gene expression, the problems of variable 
gene expression and silencing seen for genes transferred by random 
insertion into a genome should not be observed. Similarly, there is no 
5 need to engineer the genes for expression, and the genes inserted would 
not need to be recombinant genes. Thus, one expects the expression 
from the transferred genes to be temporal and spatial, as observed in the 
species from where the genes were initially isolated. A valuable feature 
for these utilities is the ability to isolate the artificial chromosomes and to 
10 further isolate, manipulate and introduce into other cells artificial 
chromosomes carrying unique genetic compositions. 

Thus, the use of artificial chromosomes and homologous 
recombination in plant cells can be used to isolate and identify many 
valuable crop traits. 

15 In addition to the use of artificial chromosomes for the isolation and 

testing of large regions of naturally occurring DNA, methods for the use 
of artificial chromosomes and cloned DNA are also contemplated. Similar 
to that described above, artificial chromosomes can be used to carry large 
regions of cloned DNA, including that derived from other plant species. 

20 The ability to incorporate novel DNA elements into an artificial 

chromosome as it is being formed allows for the development of artificial 
chromosomes specifically engineered as a platform for testing of new 
genetic combinations, or "genomic'' discoveries for model species such as 
Arabidopsis. It is known that specific "recombinase" systems can be 

25 used in plant cells to excise or re-arrange genes. These same systems 
can be used to derive new gene combinations contained on an artificial 
chromosome. 

The artificial chromosomes can be engineered as platforms to 
accept large regions of cloned DNA, such as that contained in Bacterial 
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Artificial Chromosomes (BACs) or Yeast Artificial Chromosomes (YACs). 
It is further contemplated, that as a result of the typical structure of 
artificial chromosomes containing tandemly repeated DNA blocks, that 
sequences other than cloned DNA sequence can be introduced by 
5 recombination processes. In particular recombination within a predefined 
region of the tandemly repeated DNA within the artificial chromosome 
provides a mechanism to "stack" numerous regions of cloned DNA, 
Including large regions of DNA contained within BACs or YACs clones. 
Thus, multiple combinations of genes can be introduced onto artificial 
10 chromosomes and these combinations tested for functionality. In 

particular, it is contemplated that multiple YACs or BACs can be stacked 
onto an artificial chromosomes, the BACs or YACs containing multiple 
genes of complex pathways or multiple genetic pathways. The BACs or 
YACs are typically selected based on genetic information available within 
15 the public domain, for example from the Arabidopsis Information 

Management System (http://aims.cps.msu.edu/aims/index.html) or the 
information related to the plant DNA sequences available from the 
Institute for Genomic Research {http://www.tigr.org) and other sites 
known to those skilled in the art. Alternatively, clones can be chosen at 
20 random and evaluated for functionality. It is contemplated that 

combinations providing a desired phenotype can be identified by isolation 
of the artificial chromosome containing the combination and analyzing the 
nature of the inserted cloned DNA. 

In this regard, it is contemplated that the use of site-specific 
25 recombination sequences can have considerable utility in developing 
artificial chromosomes containing DNA sequences recognized by 
recombinase enzymes and capable of accepting DNA sequences 
containing same. The use of site-specific recombination as a means to 
target an introduced DNA to a specific locus has been demonstrated in 
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the art and such methods can be employed. The recombinase systems 
can also be used to transfer the cloned DNA regions contained within the 
artificial chromosome to the naturally occurring plant or mammalian 
chromosomes. 

5 As noted herein, many site-specific recombinases are known and 

can be identified (Kilby et al. (1993) Trends in Genetics S:413-418). The 
three recombinase systems that have been extensively employed include: 
an activity identified as R encoded by the pSR1 plasmid of 
Zygosacctiaromyes rouxii, FLP encoded for the 2um circular plasmid from 

1 0 Saccharomyces cerevisiae and Cre-lox from the phage PI . 

The integration function of site-specific recombinases is 
contemplated as a means to assist in the derivation of genetic 
combinations on artificial chromosomes. In order to accomplish this, it is 
contemplated that a first step of introducing site-specific recombinase 

1 5 sites into the genome of a plant cell in an essentially random manner is 
conducted, such that the plant cell has one or more site-specific 
recombinase recognition sequences on one or more of the plant 
chromosomes. An artificial chromosome is then introduced into the plant 
cell, the artificial chromosome engineered to contain a recombinase 

20 recognition site (e.g., integration site) capable of being recognized by a 
site-specific recombinase. Optionally, a gene encoding a recombinase 
enzyme is also included, preferably under the control of an inducible 
promoter. Expression of the site-specific recombinase enzyme in the 
plant cell, either by induction of a inducible recombinase gene, or 

25 transient expression of a recombinase sequence, causes a site-specific 
recombination event to take place, leading to the insertion of a region of 
the plant chromosomal DNA (containing the recombinase recognition site) 
into the recombinase recognition site of the artificial chromosome, and 
forming an artificial chromosome containing plant chromosomal DNA. 
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The artificial chromosome can be isolated and introduced into a 
heterologous host, preferably a plant host, and expression of the newly 
introduced plant chromosomal DNA can be monitored and evaluated for 
desirable phenotypic changes. Accordingly, carrying out this 
5 recombination with a population of plant cells wherein the chromosomally 
located recombinase recognition site is randomly scattered throughout the 
chromosomes of the plant, can lead to the formation of a population of 
artificial chromosomes, each with a different region of plant chromosomal 
DNA, and each potentially representing a novel genetic combination. 

10 This method requires the precise site-specific insertion of 

chromosomal DNA into the artificial chromosome. This precision has 
been demonstrated in the art. For example, Fukushige and Sauer {(1992) 
Proc. Natl. Acad. Sci. USA, 89:7905-7909) demonstrated that the Cre- 
lox homologous recombination system could be successfully employed to 

1 5 introduce DNA into a predefined locus in a chromosome of mammalian 
cells. In this demonstration a promoter-less antibiotic resistance gene 
modified to include a /ox sequence at the 5' end of the coding region was 
introduced into CHO cells. Cells were re-transformed by electroporation 
with a plasmid that contained a promoter with a lox sequence and a 

20 transiently expressed Cre recombinase gene. Under the conditions 

employed, the expression of the Cre enzyme catalyzed the homologous 
recombination between the lox site in the chromosomally located 
promoter-less antibiotic resistance gene, and the lox site in the introduced 
promoter sequence, leading to the formation of a functional antibiotic 

25 resistance gene. The authors demonstrated efficient and correct targeting 
of the introduced sequence, 54 of 56 lines analyzed corresponded to the 
predicted single copy insertion of the DNA due to Cre catalyzed site- 
specific homologous recombination between the lox sequences. 
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Accordingly a lox sequence may be first added to a genome of a 
plant species capable of being transformed and regenerated to a whole 
plant to serve as a recombinase target DNA sequence for recombination 
with an artificial chromosome. The lox sequence may be optimally 

5 modified to further contain a selectable marker which is inactive but can 
be activated by insertion of the lox recombinase recognition sequence into 
the artificial chromosome. 

A promoterless marker gene or selectable marker gene linked to the 
recombinase recognition sequence, which is first inserted into the 

10 chromosomes of a plant cell can be used to engineer a platform 

chromosome. A promoter is linked to a recombinase recognition site, in 
an orientation that allows the promoter to control the expression of the 
marker or selectable marker gene upon recombination within the artificial 
chromosome. Upon a site-specific recombination event between a 

15 recombinase recognition site in a plant chromosome and the recombinase 
recognition site within the introduced artificial chromosome, a cell is 
derived with a recombined artificial chromosome, the artificial 
chromosome containing an active marker or selectable marker activity 
that permits the identification and or selection of the cell. 

20 The artificial chromosomes can be transferred to other plant or 

animal species and the functionality of the new combinations tested. The 
ability to conduct such an inter-chromosomal transfer of sequences has 
been demonstrated in the art. For example, the use of the Cre-lox 
recombinase system to cause a chromosome recombination event 

25 between two chromatids of different chromosomes has been shown. 
Any number of recombination systems may be employed as 
described herein, such as, but not limited to, bacterially derived systems 
such as the att/int system of phage lambda, and the Gin/gix system. 
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More than one recombination system may be employed, including, 
for example, one recombinase system for the Introduction of DNA into an 
artificial chromosome, and a second recombinase system for the 
subsequent transfer of the newly introduced DNA contained within an 
5 artificial chromosome into the naturally occurring chromosome of a 
second plant species. The choice of the specific recombination system 
used will be dependent on the nature of the modification contemplated. 

By having the ability to isolate an artificial chromosome. In 
particular, artificial chromosomes containing plant chromosomal DNA 

10 introduced via site-specific recombination, and re-introduce the 

chromosome into other mammalian or plant cells, particularly plant cells, 
these new combinations can be evaluated in different crop species 
without the need to first isolate and modify the genes, or carry out 
multiple transformations or gene transfers to achieve the same 

i 5 combination isolation and testing combinations of the genes in plants. 
The use of a site-specific recombinase also allows the convenient 
recovery of the plant chromosomal region into other recombinant DNA 
vectors and systems, such as mammalian or insect systems, for 
manipulation and study. 

20 Also contemplated herein are ACes, cell lines and methods for use 

in screening a new chromosomal combinations, deletions, truncations 
with eucaryotic genome that take advantage of the site-specific 
recombination systems incorporated onto platform ACes provided herein. 
For example, provided herein is a cell line useful for making a library of 

25 ACes, comprising a multiplic'rty of heterologous recombination sites 
randomly integrated throughout the endogenous chromosomes. Also 
provided herein is a method of making a library of ACes comprising 
random portions of a genome, comprising introducing one or more ACes 
into a cell line comprising a multiplicity of heterologous recombination 
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sites randomly integrated throughout the endogenous chromosomes, 
under conditions that promote the site-specific chromosomal arm 
exchange of the ACes into, and out of, a multiplicity of the heterologous 
recombination sites within the cell's chromosomal DNA; and isolating said 
5 multiplicity of ACes, thereby producing a library of ACes whereby multiple 
ACes have different portions of the genome within. Also provided herein 
is a library of cells useful for genomic screening, said library comprising a 
multiplicity of cells, wherein each cell comprises an ACes having a 
mutually exclusive portion of a chromosomal nucleic acid therein. The 

10 library of cells can be from a different species and/or cell type than the 
chromosomal nucleic acid within the ACes. Also provided is a method of 
making one or more cell lines, comprising 

a) integrating into endogenous chromosomal DNA of a selected cell 
species, a multiplicity of heterologous recombination sites, 

15 b) introducing a multiplicity of ACes under conditions that promote 

the site-specific chromosomal arm exchange of the ACes into, and out of, 
a multiplicity of the heterologous recombination sites integrated within the 
cell's endogenous chromosomal DNA; 

c) isolating said multiplicity oiACes, thereby producing a library of 
20 ACes whereby a multiplicity of ACes have mutually exclusive portions of 

the endogenous chromosomal DNA therein; 

d) introducing the isolated multiplicity oi ACes of step c) into a 
multiplicity of cells, thereby creating a library of cells; 

e) selecting different cells having mutually exclusive ACes therein 
25 and clonally expanding or differentiating said different cells into clonal cell 

cultures, thereby creating one or more cell lines. 

These ACes, cell lines and methods utilize the site-specific 
recombination sites on platform ACes analogous YAC manipulation 
related to: the methods of generating terminal deletions in normal and 
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artificial chromosomes (e.g., ACes; as described in Vollrath et ah, 1988, 
PNAS, USA, 85:6027-66031; and Pavan et al., PNAS, USA, 87:1300- 
1304); tlie methods of generating interstitial deletions in normal and artificial 
chromosomes (as described in Campbell et al., 1991, PNAS, USA, 
5 888:5744-5748); and the methods of detecting homologous recombination 
between two ACes (as described in Cellini et al., 1991, Nuc. Acid Res., 
19{5):997-1000). 

5. Use of plateform ACes in Pharmacogenomic/toxicology 
applications (development of "Reporter /ICas") 

10 In addition to the placement of genes onto ACes chromosomes for 

therapeutic protein production or gene therapy, the platform can be 
engineered via the IntR lambda integrase to carry reporter-linked constructs 
(reporter genes) that monitor changes in cellular physiology as measured by 
the particular reporter gene (or a series of different reporter genes) readout. 

15 The reporter linked constructs are designed to include a gene that can be 
detected (by for example fluorescence, drug resistance, 
immunohistochemistry, or transcript production, and the like) with well- 
known regulatory sequences that would control the expression of the 
detectable gene. Exemplary regulatory promoter sequences are well-known 

20 in the art. 

A) Reporter ACes for drug pathway screening 
The ACes can be engineered to carry reporter-linked constructs that 
indicate a signal is being transduced through one or a number of pathways. 
For example, transcriptionally regulated promoters from genes at the end (or 
25 any other chosen point) of particular signal transduction pathways could be 
engineered on the ACes to express the appropriate readout (either by 
fluorescent protein production or drug resistance) when the pathway is 
activated (or down-regulated as well), in one embodiment, a number of 
reporters from different pathways can be placed on an 
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ACes chromosome. Cells (and/or whole animals) containing such a 
Reporter ACes could be exposed to a variety of drugs or compounds and 
monitored for the effects of the drugs or compounds upon the selected 
pathway (s) by the reporter gene(s). Thus, drugs or compounds can be 
5 classified or identified by particular pathways they excite or down- 
regulate. Similarly, transcriptional profiles obtained from genomic array 
experiments can be biologically validated using the reporter >4Ce5 
provided herein. 

B) Reporter ACes for toxic compound testing 

10 Environmental or man-made genotoxicants can be tested in cell 

lines carrying a number of reporter-genes platform ACes linked to 
promoters that are transcriptionally regulated in response to DNA damage, 
induced apoptosis or necrosis, and cell-cycle perturbations. Furthermore, 
new drugs and/or compounds could be tested in a similar manner with the 

15 genotoxicant ACes reporter for their cellular/genetic toxicity by such a 
screen. Likewise, toxic compound testing could be carried out in whole 
transgenic animals carrying the ACes chromosome that measures 
genotoxicant exposure ("canary in a coal mine"). Thus, the same or 
similar type ACes could be used for toxicity testing in either a cell-based 

20 or whole animal setting. An example would include ACes that carry 
reporter-linked genes controlled by various cytochrome P450 profiled 
promoters and the like. 

C) Reporter >1Cto for individualized pharmacogenomics/drug 

profiling 

25 A common disease may arise via various mechanisms. In many 

instances there are multiple treatments available for a given disease. 
However, the success of a given treatment may depend upon the 
mechanism by which the disease originated and/or by the genetic 
background of the patient. In order to establish the most effective 
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treatment for a given patient one could utilize the ACes reporters provided 
herein. ACes reporters can be used in patient cell samples to determine 
an individualized drug regimen for the patient. In addition, potential 
polymorphisms affecting the transcriptional regulation of an individual's 
5 particular gene can be assessed by this approach. 

D) Reporter ACes for classification of similar patient tumors 
As with other diseases as described in 5.C) above, cancer cells 

arise via different mechanisms. Furthermore, as a cancerous cell 
propagates it may undergo genomic alterations. An ACes reporter 

10 transferred to cells of different patients having the same disease. I.e. 
similar cancers, could be used to categorize the particular cancer of each 
patient, thereby facilitating the identification of the most effective 
therapeutic regimen. Examples would include the validation of array 
profiling of certain classes of breast cancers. Subsequently, appropriate ^ 

15 drug profiling could be carried out as described above. 

E) Reporter >1Ces as a "differentiation" sensor 

Using the ACes reporter as a "differentiation" sensor in stem cells 
or other progenitor cells in order to enrich by selection (either FACS based 
screening, drug selection and/or use of suicide gene) for a particular class 
20 of differentiated or undifferentiated cells. For example, in one 

embodiment, this assay could also be used for compound screening for 
small molecule modifiers of cell differentiation. 

F) Whoie animai studies with Reporter ^Ce^ 

Finally, with whole-body fluorescence imaging technology (Yang at 
25 al. (2000) PNAS 97:1 2278) any of the above Reporter ACes methods 
could be used in conjunction with whole-body imaging to monitor reporter 
genes within whole animals without sacrificing the animal. This would 
allow temporal and spatial analysis of expression patterns under a given 
set of conditions. The conditions tested may include for example, normal 
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differentiation of a stem cell, response to drug or compound treatment 
whether targeted to the diseased tissue or presented systemically, 
response to genotoxicants, and the like. 

The following examples are included for illustrative purposes only 
5 and are not intended to limit the scope of the invention. 

EXAMPLE 1 

pFK161 

Cosmid pFK161 (SEQ ID NO: 118) was obtained from Dr. GyUla 
Hadlaczky and contains a 9 kb NoA insert derived from a murine rDNA 

10 repeat (see clone 161 described in PCT Application Publication No. 

WO97/40183 by Hadlaczky et af, for a description of this cosmid). This 
cosmid, referred to as clone 161 contains sequence corresponding to 
nucleotides 10,232-15,000 in SEQ ID NO. 26. It was produced by 
inserting fragments of the megachromosome (see, U.S. Patent No. 

15 6,077,697 and International PCT application No. WO 97/40183). For 
example, H1D3, which was deposited at the European Collection of 
Animal Ceil Culture (ECACC) under Accession No. 96040929, is a 
mouse-hamster hybrid cell line carrying this megachromosome into 
plasmid pWE15 (Stratagene, La Jolla, California; SEQ ID No. 31) as 

20 follows. Half of a 100 //I low melting point agarose block (mega-plug) 
containing isolated SATACs was digested with NoA overnight at 37®C, 
Plasmid pWEIS was similarly digested with NoA overnight. The mega- 
plug was then melted and mixed with the digested plasmid, ligation buffer 
and T4 DNA ligase. Ligation was conducted at 16 ^C overnight. Bacterial 

25 DH5a cells were transformed with the ligation product and transformed 
cells were plated onto LB/Amp plates. Fifteen to twenty colonies were 
grown on each plate for a total of 189 colonies. Plasmid DNA was 
isolated from colonies that survived growth on LB/Amp medium and 
analyzed by Southern blot hybridization for the presence of DNA that 
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hybridized to a pUC19 probe. This screening methodology assured that 
all clones, even clones lacking an insert but yet containing the pWE15 
plasmid, would be detected. 

Liquid cultures of all 189 transformants were used to generate 
5 cosmid mlnipreps for analysis of restriction sites within the insert DNA. 
Six of the original 189 cosmid clones contained an insert. These clones 
were designated as follows: 28 |-'9-kb insert), 30 9-kb insert), 60 
('-4-kb insert), 113 {~9-kb insert), 157 {--9-kb insert) and 161 (^9-kb 
insert). Restriction enzyme analysis indicated that three of the clones 

10 (1 13, 1 57 and 1 61 ) contained the same insert. 

For sequence analysis the insert of cosmid clone no. 161 was 
subcloned as follows. To obtain the end fragments of the insert of clone 
ho. 161, the clone was digested with Not\ and BamH\ and ligated with 
/Vori/5amHI-digested pBluescript KS (Stratagene, La Jolla, California). 

15 Two fragments of the insert of clone no. 1 61 were obtained: a 0.2-kb and 
a 0.7-kb insert fragment. To subclone the internal fragment of the insert 
of clone no. 161, the same digest was ligated with flamH I -digested 
. pUCI 9. Three fragments of the insert of clone no. 161 were obtained: a 
0.6-kb, a 1 .8-kb and a 4.8-kb insert fragment. 

20 The insert corresponds to an internal section of the mouse 

ribosomal RNA gene (rDNA) repeat unit between positions 7551-15670 
as set forth in GENBANK accession no. X82564, which is provided as 
SEQ ID NO. 18. The sequence data obtained for the insert of clone no. 
161 is set forth in SEQ ID NOS. 19-25. Specifically, the individual 

25 subclones corresponded to the following positions in GENBANK accession 
no. X82564 (SEQ ID NO: 18) and in SEQ ID NOs. 19-25: 
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Subclone 


Start 


End 


She 


SEQ ID No. 




in X82564 






161k1 


7579 


7755 


Noti BanM\ 

IvVtMf tJOll ft II 


19 


161m5 


7756 


8494 


BamHl 


20 


161m7 


8495 


10231 


BamHl 


21 (shows only sequence corresponding 
to nt. 8495-8950), 

22 (shows only sequence corresponding 
tont. 9851-10231) 


161in12 


10232 


15000 


BamHl 


23 (shows only sequence conresponding 
tont. 10232-10600), 

24 (shows only sequence corresponding 
tont 14267-15000) 


161k2 


15001 


15676 


Noti, BamH\ 


25 



The sequence set forth in SEQ ID NOs. 19-25 diverges in some 
10 positions from the sequence presented in positions 7551-15670 of 

GENBANK accession no. X82564. Such divergence may be attributable 
to random mutations between repeat units of rDNA. 

For use herein, the rDNA insert from the clone was prepared by 
digesting the cosmid with Not and BglW and was purified as described 
1 5 above. Growth and maintenance of bacterial stocks and purification of 
plasmids were performed using standard well known methods (see, e.g., 
Sambrook et ah (1 989) Molecular Cloning: A Laboratory Manual, 2nd 
Edition, Cold Spring Harbor Laboratory Press), and plasmids were purified 
from bacterial cultures using Midi - and Maxi-preps Kits (Qiagen, 
20 Mississauga, Ontario). 
pDsRed1N1 

This vector is available from Clontech (see SEQ ID No. 29) and 
encodes the red fluorescent protein (DsRed; Genbank accession no. 
AF27271 1; SEQ ID Nos. 39 and 40). DsRed, which has a vivid red 
25 fluorescence, was isolated from the IndoPacific sea anemone relative 
Discosoma species. The plasmid pDsRedlNI (Clontech; SEQ ID No. 29) 
constitutively expresses a human codon-optimized variant of the 
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fluorescent protein under control of the CMV promoter. Unmodified, this 
vector expresses high levels of DsRedl and includes sites for creating N- 
terminal fusions by cloning proteins of interest into the multiple cloning 
site (MCS). It is Kan and Neo resistant for selection in bacterial or 
5 eukaryotic cells. 
Plasmid pMG 

Plasmid pMG (InvivoGen, San Diego, California; see SEQ. ID. NO. 
27 for the nucleotide sequence of pMG) contains the hygromycin 
phosphotransferase gene under the control of the immediate-early human 

10 cytomegalovirus (hCMV) enhancer/promoter with intron A- Vector pMG 
also contains two transcriptional units allowing for the coexpression of 
two heterologous genes from a single vector sequence. 

The first transcriptional unit of pMG contains a multiple cloning site 
for insertion of a gene of interest, the hygromycin phosphotransferase 

15 gene (hph) and the immediate-early human cytomegalovirus (hCMV) 
enhancer/promoter with intron A (see, e.g., Chapman et al. (1991) Nuc. 
Acids Res. 75:3979-3986) located upstream of hph and the multiple 
cloning site, which drives the expression of hph and any gene of interest 
inserted into the multiple cloning site as a polycistronic mRNA. The first 

20 transcriptional unit also contains a modified EMCV internal ribosomal 

entry site (IRES) upstream of the hph gene but downstream of the hCMV 
promoter and MCS for ribosomal entry in translation of the hph gene (see 
SEQ ID NO. 27, nucleotides 2736-3308). The IRES is modified by 
insertion of the constitutive E. coli promoter (EM7) within an intron (IM7) 

25 into the end of the IRES. In mammalian cells, the £. coli promoter is 
treated as an intron and is spliced out of the transcript. A 
polyadenylation signal from the bovine growth hormone (bGh) gene (see, 
e.g., Goodwin and Rottman (1992) J. Biol. Chem. 267:16330-16334) 
and a pause site derived from the 3' flanking region of the human a2 
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globin gene (see, e.g., Enriquez-Harris etal. (1991) J. 70:1 833- 

1 842) are located at the end of the first transcription unit. Efficient 
polyadenylation is facilitated by Inserting the flanking sequence of the 
bGh gene 3' to the standard AAUAAA hexanucleotide sequence. 
5 The second transcriptional unit of pMG contains another multiple 

cloning site for insertion of a gene of interest and an EF-la/HTLV hybrid 
promoter located upstream of this multiple cloning site, which drives the 
expression of any gene of interest inserted into the multiple cloning site. 
The hybrid promoter is a modified human elongation factor-1 alpha (EF-1 

10 alpha) gene promoter (see, e.g., Kim etal. (1990) Gene S/;217-223) 
that includes the R segment and part of the U5 sequence (R-U5') of the 
human T-cell leukemia virus (HTLV) type 1 long terminal repeat (see, e.g., 
Takebe etal. (1988) MoL Cell, fl/o/ 5:466-472). The Simian Virus 40 
(SV40) late polyadenylation signal (see Carswell and Alwine (1989) MoL 

15 Cell. Biol. 5:4248-4258) is located downstream of the multiple cloning 
site. Vector pMG contains a synthetic polyadenylation site for the first 
and second transcriptional units at the end of the transcriptional unit 
based on the rabbit i?-globin gene and containing the AATAAA 
hexanucleotide sequence and a GT/T-rich sequence with 22-23 

20 nucleotides between them (see, e.g., Levitt etal. (1989) Genes Dev. 

3:1019-1025). A pause site derived from the C2 complement gene (see, 
Moreira etal. (1995) EMBO J. 74:3809-3819) is also located at the 3' 
end of the second transcriptional unit. 

Vector pMG also contains an ori sequence (ori pMBI) located 

25 between the SV40 polyadenylation signal and the synthetic 
polyadenylation site. 

EXAMPLE 2 

A. Construction of targeting vector and transfection into LMtk- cells 
for the generation of platform chromosomes 
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A targeting vector derived from the vector pWE15 (GeneBank 
Accession # X65279) was modified by replacing the Sal\ (Klenow 
filled)/Smai neomycin resistance containing fragment with the 
Pvu\\IBamH\ (Klenow filled) puromycin resistance containing fragment 

5 (isolated from plasmid pPUR, Clontech Laboratories, Inc. Palo Alto, CA; 
SEQ ID No. 30) resulting In plasmid pWEPuro. Subsequently a 9 Kb NoH 
fragment from the plasmid pFK161 (SEQ ID NO: 118) containing a portion 
of the mouse rDNA region was cloned Into the No1\ site of pWEPuro 
resulting in plasmid pWEPuro9K (Figure 2). The vector pWEPuro9K was 

10 digested w'rth Spe\ to linearize and transfected into LMtk- mouse cells. 
Puromycin resistant colonies were isolated and subsequently tested for 
artificial chromosome formation via fluorescent in situ hybridization (FISH) 
(using mouse major and minor DNA repeat sequences, the puromycin 
gene and telomeres sequences as probes), and fluorescent activated cell 

15 sorting (FACS). From this sort, a subclone was isolated containing an 
artificial chromosome, designated 5B11.12, which carries 4-8 copies of 
the puromycin resistance gene contained on the pWEPuro9K vector. 
FISH analysis of the 5811.12 subclone demonstrated the presence of 
telomeres and mouse minor on the ACes. DOT PGR has been done on 

20 the 581 1.12 ACes revealing the absence of uncharacterized euchromatic 
regions on the ACes. A recombination site, such as an att or loxP 
engineering site or a plurality thereof, was introduced onto this ACes 
thereby providing a platform for site-specific introduction of heterologous 
nucleic acid. 

25 B. Targeting a single sequence specific recombination site onto 
platform chromosomes 

After the generation of the 581 1 .1 2 platform, a single sequence- 
specific recombination site is placed onto the platform chromosome via 
homologous recombination. For this, DNA sequences containing the site- 
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specific recombination sequence can be flanked with DNA sequences of 
homology to the platform chromosome. For example, using the platform 
chromosome made from the pWEPuroSK vector, mouse rDNA sequences 
or mouse major satellite DNA can be used as homologous sequences to 
5 target onto the platform chromosome. A vector is designed to have these 
homologous sequences flanking the site-specific recombination site and, 
after the appropriate restriction enzyme digest to generate free ends of 
homology to the platform chromosome, the DNA is transfected into cells 
harboring the platform chromosome (Figure 3), Examples of site-specific 

1 0 cassettes that are targeted to the platform chromosome using either 
mouse rDNA or mouse major repeat DNA include the SV40-attP-hygro 
cassette and a red fluorescent protein (RFP) gene flanked by loxP sites 
(Cre/lox, see, e.g., U.S. Patent No. 4,959,317 and description herein). 
After transfection and integration of the site-specific cassette, 

15 homologous recombination events onto the platform chromosome are 

subcloned and identified by FACS (e.g. screen and single cell subclone via 
expression of resistance or fluorescent marker) and PCR analysis. 

For example, a vector can be constructed containing regions of the 
mouse rDNA locus flanking a gene cassette containing the SV40 early 

20 reporter-bacteriophage lambda attP site-hygromycin selectable marker 

(see Figure 4 and described below). The use of the bacteriophage lambda 
attP site for lambda integrase-mediated site-specific recombination is 
described below. Homologous recombination event of the SV40-attP- 
hygro cassette onto the platform chromosome was identified using PCR 

25 primers that detect the homologous recombination and further confirmed 
by FISH analysis. After identifying subcloned colonies containing the 
platform chromosome with a single site-specific recombination site, cells 
carrying the platform chromosome with a single site-specific 
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recombination site can now be engineered with site-specific recombinases 
(e.g. lannbda INT, Cre) for integrating a target gene expression vector. 

C. Targeting a red fluorescent protein (RFP) gene fianiced by loxP sites 
5 onto 5B1 1.12 platform 

As another example, while loxP recombination sites could have 
been introduced onto the ACes during de novo biosynthesis, it was 
thought that this might result in multiple segments of the ACes containing 
a high number of loxP sites, potentially leading to instability upon Cre- 

10 mediated recombination. A gene targeting approach was therefore 

devised to introduce a more limited number of loxP recombination sites 
into a locus of the 5B^^-^2 ACes containing introduced and possibly co- 
amplified endogenous rDNA sequences. Although there are more than 
200 copies of rDNA genes in the haploid mouse genome distributed 

15 amongst 5-1 1 chromosomes (depending on strain), rDNA sequences were 
chosen as the target on the ACes since they represent a less frequent 
target than that of the satellite repeat sequences. Moreover, having 
observed much stronger pWEPuroSK hybridization to the 5B1 1-12 /ICes 
than to other LMTK" chromosomes and in light of the observation that the 

20 transcribed spacer sequences within the rDNA may be less conserved 

than the rRNA coding regions, it was contemplated that a targeting vector 
based on the rDNA gene segment in pWEPuro9K would have a higher 
probability of targeting to the ACes rather than to other LMTK" 
chromosomes. Accordingly, a targeting vector, pBSFKLoxDsRedLox, was 

25 designed and constructed based on the rDNA sequences contained in 
pWEPuro9K. 

The plasmid pBSFKLoxDsRedLox was generated in 4 steps. First, 
the Not\ rDNA insert of pWEPuro9K (Figure 2) was inserted into pBS SK- 
(Stratagene) giving rise to pBSFK. Second, a loxP polylinker cassette was 
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generated by PCR ampiification of pNEB193 (SEQ ID N0:32; New 
England Biolabs) using primers complementary to the Ml 3 forward and 
reverse priming sites at their 3'end and a 34 bp 5' extension comprising a 
LoxP site. This cassette was reinserted Into pNEB193 generating 
5 p193LoxMCSLox. Third, the DsRed gene from pDsRedl-NI (SEQ ID 
NO:29; Clontech) was then cloned into the polylinker between the loxP 
sites generating p193LoxDsRedLox. Fourth, a fragment consisting of the 
DsRed gene flanked by loxP sites was cloned into a unique Nde\ within 
the rDNA insert of pBSFK generating pBSFKLoxDsRedLox, 

10 A gel purified 1 1 Kb Pml\ /EcoRV fragment of pBSFKLoxDsRedLox 

was used for transfection. To detect targeted integration, PCR primers 
were designed from rDNA sequences within the 5' Not\-Pm/\ fragment of 
pWEPuroSK that is not present on the targeting fragment (5'primer) and 
sequence within the LoxDsRedLox cassette (3' primer). If the targeting 

1 5 DNA integrated correctly within the rDNA sequences, PCR amplification 
using these primers would give rise to a 2.3 Kb band. PCR reactions 
containing 1 -4 //I of genomic DNA were carried out according to the 
MasterTaq protocol (Eppendorf), using murine rDNA 5' primer (5'- 
CGGACAATGCGGTTGTGCGT-3'; SEQ ID NO:72) and DsRed 3'primer 

20 {5'GGCCCCGTAATGCAGAAGAA-3'; SEQ ID NO:73) and PCR products 
were analyzed by agarose gel electrophoresis. 

1 .5X10® 5B11-12 LMTK- cells were transfected with 2 /jg of the 
pBSFKLoxDsRedLox targeting DNA described above using Lipofectamine 
Plus (Invitrogen). For flow sorting, harvested cells were suspended in 

25 medium and applied to the Becton Dickinson Vantage SE cell sorter, 

equipped with 488 nm lasers for excitation and 585/42 bandpass filter for 
optimum detection of RFP fluorescence. Cells were sorted using dPBS as 
sheath buffer. Negative control parental 581 1-12 cells and a positive 
control LMTK' cell line stably transfected with DsRed were used to 
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establish the selection gates. The RFP positive gated populations were 
recovered, diluted in medium supplemented v/ith IX penicillin- 
streptomycin (Invltrogen), then plated and cultured as previously 
described. After 4 rounds of enrichment, the percentage of RFP positive 
cells reached levels of 50% or higher. DNA from populations was 
analyzed by PCR for evidence of targeted Integration. Ultimately, single 
cell subclones were established from positive pools and were analyzed by 
PCR and PCR-positive clones confirmed by FISH as described below, 
DNA was purified from pools or single cell clones using previously 
described methods set forth in Lahm et al., Transgenic Res.. 1998; 
7:131-134, or in some cases using a Wizard Genomic DNA purification kit 
(Promega). For FISH analysis, a biotinylated DsRed gene probe was 
generated by PCR using DsRed specific primers and biotin-labeled dUTP 
(5' RFP primer: 5'-GGTTTAAAGTGCGCTCCTCCAAGAACGTCATC-3', 
SEQ ID N0:74; and 3' RFP primer: 

5'AGATCTAGAGCCGCCGCTACAGGAACAGGTGGTGGCGGCC-3'; SEQ 
ID NO:75). To maximize the signal intensity of the DsRed probe, 
Tyramide amplification was carried out according to the manufacturers 
protocols (NEN). 

The process of testing the feasibility of a more general targeting 
strategy that would not rely on enrichment via drug selection of stably 
transfected clones can be summarized as follows. A red fluorescent 
protein gene (RFP; encoded by the DsRed gene) was inserted between the 
loxP sites of the targeting vector to form pBSFKLoxDsRedLox. After 
transfection with PBSFKLoxDsRedLox, sequential rounds of high speed 
flow sorting and expansion of sorted cells in culture could then be used to 
enrich for stable transformants expressing RFP. In the event of targeted 
integration, PCR screening with primers that amplify from a spacer region 
within the segment of the 45s pre-rRNA gene in pWEPuro9K to a specific 
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anchor sequence within tlie DsRed gene in the targeting cassette would 
give rise to a diagnostic 2.3 Kb band. However, as rDNA clusters are 
found on several chromosomes, confirmation of targeting to an ACes 
would require fluorescence in situ hybridization (FISH) analysis. Finally, 
5 the flanking of the DsRed gene by loxP sites would allow for its removal 
and subsequent replacement with other genes of interest. 

After transf action of the targeting sequence Into 5B1 1-12 cells, 
enrichment for targeted clones was carried out using a combination of 
flow cytometry to detect red-fluorescing cells and PGR screening. 

10 Ultimately 17 single cell subclones were identified as potential targeted 
clones by PGR and of these 1 6 were found by FISH to contain the DsRed 
integration event into the ACes. These subclones are referred to herein 
as D11-C4, D11-G12, D11-H3, C9-G9, C9-B9, G9-F4, C9-H8, C9-F2, G9- 
G8, G9-B6, G9-G3, G9-E12, C9-A11, G11-E3, G11-A9 and C11-H4. PGR 

15 analysis of genomic DNA isolated from the D1 1-G4 subclone gave rise to 
a 2.3 Kb band, indicative of a targeted Integration into an rDNA locus. 
Further analysis of the subclone by FISH analysis with a DsRed gene 
probe demonstrated integration of the LoxDsRedLox targeting cassette on 
the ACes co-localizing with one of the regions of rDNA staining seen on 

20 the 581 1-12 /ICes, consistent with a targeted integration into an rDNA 
locus of the ACes, while integrations on other chromosomes were not 
observed. Since transf acted cells were maintained as heterogeneous 
populations through several cycles of sorting and replating it was not 
possible to estimate the frequency of targeted events. In most 

25 mammalian cell lines the frequency of gene targeting via homologous 
recombination is roughly 10'^-10"^ treated cells. Despite the low 
frequency of these events in mammalian cells, it is clear that an RFP 
expression based screening paradigm, coupled with PGR analysis, can 
effectively detect and enrich for such infrequent events in a large 
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population. In instances where drug selection is not possible or not 
desirable, such a system may provide a useful alternative. It was also 
verified that the modified ACes in subclone D1 1-C4 could be purified by 
flow cytometry. The results indicate that the flow karyogram of the D1 1- 
5 C4 subclone was unaltered from that of the 5B1 1-1 2 cell line. Thus, the 
D1 1-C4 ACes can be purified in high yield from native chromosomes of 
the host cell line. 

D. Reduction of LoxP on ACes to a single site. 

10 The strong hybridization signal detected by FISH on the ACes using 

the DsRed gene probe suggests that several copies of the targeting 
cassette may be present on the ACes in the D1 1-C4 line. This also 
suggests that multiple rDNA genes have been correctly targeted. 

Accordingly, in certain embodiments where necessary, the number 

15 of loxP sites on the ACes can be reduced to a single site by fn situ 
treatment with Cre recombinase, provided that the sites are co-linear. 
Such a process is described for multiple loxP-flanked integrations on a 
native mouse chromosome (Garrick et al.. Nature Genet. , 1998, 
Jan;1 8(1 ):56-59). Reduction to a single loxP site on the D1 1-C4 ACes 

20 would result in the loss of the DsRed gene, forming the basis of a useful 
screen for this event. 

For this purpose, a Cre expression plasmid pCX-Cre/GFP III has 
been generated by first deleting the EcoRI fragment of pCX-eGFP (SEQ ID 
N0:71) containing the eGFP coding sequence and replacing it w'rth that of 

25 a PCR amplified Cre recombinase coding sequence (SEQ ID NO:58), 
generating pCX-Cre. Next, the Asel/Sspl fragment of pD2eGFP-N1 
(containing the CMV promoter driving the D2EGFP gene with SV40 polyA 
signal; Clontech; SEQ ID NO:87) was inserted into the filled Hindlll site of 
pCX-Cre, generating pCX-Cre\QFP III. Control plasmid pCX-CreRev\GFP 
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III was generated In similar fashion except that the Cre recombinase 
coding sequence was inserted in the antisense orientation. LMTK' cell 
line D1 1-C4 (containing first generation platform ACes with multiple loxP- 
DsRED sites) and 5B1 1-12 cell line (containing ACes with no loxP-DsRED 

5 sites) are maintained in culture as described above. D1 1C4 cells are 
transfected with 2 fjg of plasmid pCX-Cre\GFP III or 2 //g pCX- 
CreRevVGFP III using Lipofectamine (Invitrogen) as previously described. 

Forty-eight to seventy-two hours after transfection, transfected 
D1 1-C4 cells are harvested and GFP positive cells are sorted by cell 

10 cytometry using a FACSta Vantage cell sorter (Beckton-Dickinson) as 
follows: All D1 1-C4 cells transfected with pCX-Cre\GFP III or control 
plasmid pCX-CreRev\GFP III that exhibit GFP fluorescent higher than the 
gate level established by untransfected ceils are collected and placed in 
culture a further 7-14 days. After 7-14 days the initial D1 1-C4 cells are 

15 harvested and analyzed by cell cytometry as follows: Untransfected D1 1- 
C4 cells are used to establish the gate that defines the RFP positive 
population, while 581 1-1 2 cells are used to set the RFP negative gate. 
The GFP positive population of D1 1-C4 transfected with pCX-Cre\GFP III 
should show decreased red fluorescence compared to pCX-CreRev\GFP III 

20 transfected or untransfected control D1 1-C4 cells. The cells exhibiting 
greatly decreased or no RFP expression are collected and single cell 
clones subsequently established. These clones will be expanded and 
analyzed by fluorescence in-situ hybridization and Southern blotting to 
confirm the removal of loxP-DsRed gene copies. 

25 

EXAMPLE 3 

Construction of targeting vector and transfection into LIVItk- cells for the 
generation of platform chromosomes containing multiple site-specific 
recombination sites 



-107- 

An example of a selectable marker system for the creation of a 
chromosome-based platform is shown in Figure 4. This system includes a 
vector containing the SV40 early promoter immediately followed by (1) a 
282 base pair (bp) sequence containing the bacteriophage lambda attP 
5 site and (2) the puromycin resistance marker. Initially a Pvu\\IStu\ 
fragment containing the SV40 early promoter from plasmid pPUR 
(Clontech Laboratories, Inc., Palo Alto, CA; Seq ID No. 30) was 
subcloned into the £co/?l/CRI site of pNEB193 (a PUC19 derivative 
obtained from New England Biolabs, Beverly, MA; SEQ ID No. 32) 
10 generating the plasmid pSV401 93. The only differences between pUC 19 
and pNEB193 are in the polylinker region. A unique AscI site 
(GGCGCGCC) is located between the BamH\ site and the Smal site, a 
unique Pad site (TTAATTAA) is located between the BamH\ site and the 
Xba\ site and a unique Pmel site (GTTTAAAC) is located between the PsA 
15 site and the Sail site. 

The attP site was PGR amplified from lambda genome (GenBank 
Accession # NC 001416) using the following primers: 

attPUP: CCTTGCGCTAATGCTCTGTTACAGG SEQ ID No. 1 
attPDWN: CAGAGGCAGGGAGTGGGACAAAATTG SEQ ID No. 2 
20 After amplification and purification of the resulting fragment, the 

attP site was cloned into the Sma\ site of pSV40193 and the orientation 
of the attP site was determined by DNA sequence analysis (plasmid 
pSV40193attP). The gene encoding puromycin resistance (Pure) was 
isolated by digesting the plasmid pPUR (Clontech Laboratories, Inc. Palo 
25 Alto, CA) with Age\IBamH\ followed by filling in the overhangs with 
Klenow and subsequently cloned into the Asc\ site downstream of the 
attP site of pSV401 93attP generating the plasmid 
pSV40193attPsensePUR (Figure 4; SEQ ID N0:1 13)). 
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The piasmid pSV40193attPsensePUR was digested with Sca\ and 
co-transfected with the piasmid pFK161 {SEQ ID NO: 118) into mouse 
LMtk- cells and platform artificial chromosomes were identified and 
isolated as described above. The process for generating this exemplary 
5 platform ACes containing multiple site-specific recombination sites is 
summarized in Figure 5. One platform ACes resulting from this 
experiment is designated 81 9-18. This platform ACes chromosome may 
subsequently be engineered to contain target gene expression nucleic 
acids using the lambda integrase mediated site-specific recombination 
10 system as described herein in Example 7 and 8. 

EXAMPLE 4 

Lambda integrase mediated site-specific recombination of a RFP 
expressing vector onto artificial chromosomes 

in this example, a vector expressing the red fluorescent protein 

1 5 (RFP) was produced and recombined into the attP site residing on an 

artificial chromosome within LMTK- cells. This recombination is depicted 

in Figure 7. 

A. Construction of expression vectors containing wildtype and 
mutant lambda Integrase 

20 Mutations at the glutamic acid at position 1 74 in the lambda 

integrase protein relaxes the requirement for the accessory protein IHF 

during recombination and DNA supercoiling in vitro {see, Miller et ai. 

(1980) Ce// 20:721 -729; Lange-Gustafson etaL (1984) J. Biol. Ctiem. 

255:12724-12732). Mutations at this site promote attP, attB 
25 intramolecular recombination in mammalian cells (Lorbach et aL (2000) J. 

l\/lol. 5/0/256:1175-1181). 

To construct nucleic acid encoding the mutant, lambda integrase 

was PGR amplified from bacteriophage lambda DNA (cl857 \nd Sam 7; 

New England Biolabs) using the following primers: 
30 Laminti (SEQ ID No. 3) 
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TTCGAATTCATGQQAAGAAGGCGAAGTCATGAGCG) 
Lamint2 (SEQ ID No. 4) 

(TTCG AATTCTTATTTG ATTTCAATTTTGTCCCAC) . 
The resulting PCR product was digested with EcoR I and cloned into the 
5 EcoR I site of pUCI 9. Lambda integrase was mutated at amino acid 
position 1 74 using QuikChange Site-Directed ly/lutagenesis Kit 
(Stratagene) and the following ollgos (generating a glutamic acid to 
arginine change at position 174): 
LambdalNTE174R 
10 (SEQ ID No. 6) 

(CGCGCAGCAAAATCTAGAGTAAGGAGATCAAGACTTACGGCTGACG), 

LamintR174rev (SEQ ID No. 7) 

(CGTCAGCCGTAAGTCTTGATCTCCTTACTCTAGATTTTGCTGCGCG) . 
The resulting site directed mutant was confirmed by sequence analysis. 

1 5 The wildtype and mutant lambda genes were cloned into the EcoR I site 
of pCX creating pCX-LamInt (SEQ ID NO: 1 27) and pCXLamlntR (Figure 
8; SEQ ID NO: 112). 

The plasmid pCX (SEQ ID No. 70) was derived from plasmid 
pCXeQFP (SEQ ID No. 71). Excision of the EcoRI fragment containing the 

20 eGFP marlcer generated pCX. To generate plasmid pCXLamlNTR (SEQ ID 
NO: 112) an EcoRI fragment containing the lambda integrase E174R (SEQ 
ID No. 37) mutation was cloned into the EcoRi site of pCX, and to 
generate plasmid pCX-LamINT, an EcoRI fragment containing the wild- 
type lambda integrase was cloned into the EcoRI site of pCX. 

25 B. Construction of integration vector containing attB and DsRed 

The plasmid pDsRedNI (Clontech Laboratories, Palo Alto, OA; SEQ 
ID No. 29) was digested with Hpa I and ligated to the following annealed 
oligos: 

attBl (SEQ ID No. 8) 
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(TGAAGCCTGCTTTTTTATACTAACTTGAGCGAA) 
attB2 (SEQ ID No. 9) 

(TTCGCTCAAGTTAGTATAAAAAAGCAGGCTTCA) 
The resulting vector (pDsRedNI-attB) was confirmed by PCR and 
5 sequence analysis. 

C. Transfection into LMtk- cells 

LM(tk-) cells containing the Prototype A >4Ces (LI -18; Chromos 
Molecular Systems Inc., Burnaby, BC Canada) were co-transfected with 
pDsRedNI or pDsRedNI-attB and either pCXLamInt {SEQ ID NO: 127) or 
10 pCXLamlntR (SEQ ID NO: 112) using Lipofectamine Plus Reagent 

(LifeTechnologies, Gaithersburg, MD). The transfected cells were grown 
In DMEM (LifeTechnologies, Gaithersburg, MD) with 10% FBS (CanSera) 
and G418 (CalBiochem) at a concentration of 1 mg/ml. 

D. Enrichment by cell sorting 

15 The transfected cells were sorted using a FACs Vantage SE cell 

sorter (Becton Dickenson) to enrich for cells expressing DsRed. The cells 
were excited with a 488 nm Argon laser at 200 watts and cells 
fluorescing in the 585/42 detection channel were collected. The sorted 
cells were returned to growth medium for recovery and expansion. After 

20 three successive enrichments for cells expressing DsRed, single cell 
sorting into 96 well plates was performed using the same parameters. 
Duplicate plates of the single cell clones were made for PCR analysis. 

E. PCR analysis of single cell clones 

Pools of cells from each row and column of the 96 well plate were 
25 used for DNA isolation. DNA was prepared using a Wizard Genomic DNA 
purification kit (Promega Inc, Madison, Wl). Nested PCR analysis on the 
DNA pools was performed to confirm the site-specific recombination 
event using the following primer sets: 
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attPdwn2 (SEQ ID No. 10) 
(TCTTCTCGGGCATAAGTCGGACACC) 

CMVen (SEQ ID No. 11) 
(CTCACGGGGATTTCCAAGTCTCCAC) 

5 followed by: 

attPdwn (SEQ ID No. 12) 
(CAGAGGCAGGGAGTGGGACAAAATTG) 

CMVen2 (SEQ ID No. 13) 
(CAACTCCGCCCCATTGACGCAAATG) . 

10 The resultjng PGR reactions were analyzed by gel electrophoresis and the 
potential individual clones containing the site-specific recombination event 
were identified by combining the PGR results of all of the pooled rows 
and columns for each 96 well plate. The individual clones were then 
further analyzed by PGR using the following primers that flank the 

1 5 recombination junction. LI for and F1 rev flank the attR junction whereas 

REDfor and L2rev flank the attL junction (see Figure 7): 

LI for (SEQ ID No. 14) 
AGTATCGGGGAAGGATTAGGTGTTGA 

Firev (SEQ ID No. 15) 
20 GCGGATTTGQGCCTATTGGTTAAA 

REDfor (SEQ ID No. 16) 
CGGGGGACATCCGGGAGTAGAAGAA 

L2rev (SEQ ID No. 1 7) 
TTGGTTCGAAGGGGATGCGGGTAGC. 

25 F. Sequence analysis of recombination junctions 

PGR products spanning the recombination junction were Topo- 

cloned into pcDNA3.1DA/5His (Invitrogen Inc., San Diego, GA) and then 

sequenced by cycle-sequencing. The clones were confirmed to have the 

correct affR and 8«L junctions by cycle sequencing. 

30 G. Fluorescent In Situ Hybridization (FISH) 

The cell lines containing the correct recombination junction 

sequence were further analyzed by fluorescent in situ hybridization (FISH) 
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by probing with the DsRed coding region labeled with biotin and 
visualizing with the Tyramide Signal Amplification system (TSA; NEN Life 
Science Products). The results indicate that the RFP sequence is present 
on the ACes. 

5 H. Southern analysis 

Genomic DNA was harvested from the cell lines containing an 
ACes with the correct recombinant event and digested with EcoR I- The 
digested DNAs were separated on a 0.7% agarose gel, transferred and 
fixed to a nylon membrane and probed with RFP coding sequences. The 

10 result showed that there is an integrated copy of RFP coding sequence in 
each clone. 

EXAMPLE 5 

Delivery of a second gene encoding GFP onto the RFP platform ACes 

A. Construction of integration vector containing attB and GFP 
1 5 (pD2eGFPIresPuroattB). 

The plasmid pIRESpuro2 {Clontech, Palo Alto, CA; SEQ ID NO: 88) 
was digested with EcoRI and NoA then ligated to the D2eGFP fcoRI-/Vori 
fragment from pD2eGFP-N1 {Clontech, Palo Alto, CA) to create 
pD2eGFPIresPuro2. Subsequently, oligos encoding the attB site were 
20 annealed and ligated into the Nru\ site of pD2eGFPIresPuro2 to create 
pD2eGFPIresPuroattB. The orientation of attB in the Nru\ site was 
determined by PCR. 

B. Transfection of LMtk- cells 

The LMtk- cells containing the RFP platform ACes produced in 
25 Example 4, which has multiple attP sites, were co-transfected with 
pCXLamlntR and pD2eGFPIresPuroattB using LipofectAMINE PLUS 
reagent. Five jug of each vector was placed into a tube containing 750 pi 
of DMEM (Dulbecco's modified Eagles Medium). Twenty a/I of the Plus 
reagent was added to the DNA and incubated at room temperature for 1 5 
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minutes. A mixture of 30 fj\ of lipofectamine and 750 //I DMEM was 
added to the DNA mixture and incubated an additional 1 5 minutes at 
room temperature. Tlie DNA mixture was then added dropwise to 
approximately 3 million cells attached to a 10cm dish in 5 mis of DMEM. 

5 The cells were incubated 4 hours (37X, 5% CO2) with the DNA-lipid 
mixture, after which DMEM with 20% fetal bovine serum was added to 
the dishes to bring the culture medium to 10% fetal bovine serum. The 
dishes were incubated at 37*^0 with 5% CO2. 

Plasmid pD2eGFPiresPuroattB has a puromycin gene 

10 transcriptionally linked to the GFP gene via an IRES element. Two days 
after the transfection the cells were placed in medium containing 
puromycin at 4//g/ml to select for cells containing the 
pD2eGFPIresPuroattB plasmid integrated into the genome. Twenty-three 
clones were Isolated after 17 days of selection with puromycin. These 

15 clones were expanded and then analyzed for the presence of the GFP 
gene on the ACes by 2-color (RFP/biotin & GFP/digoxigenin) TSA-FISH 
(NEN) according to the manufacturers protocol. Sixteen of the 23 clones 
produced a positive FISH signal on the ACes with a GFP probe. 

EXAMPLE 6 

20 Delivery Of ACes Into human Mesenchymal Stem Cells (hMSC) 
A. Transfection 

Transfection conditions for the most efficient delivery of the ACes 
into hMSCs (Cambrex BioWhittaker Product Code PT-2501, lot# F0658, 
East Rutherford, New Jersey) were assayed using LipofectAMINE PLUS 
25 and Superfect. One million prototype B ACes, which is a murine derived 
60Mb ACes having primarily murine pericentric heterochrornatin, and 
carrying a "payload" containing a hygromycin B selectable marker gene 
and a lacZ reporter gene (see , Telenius at al., 1999, Chrom. Res,, 7:3-7; 
and Kereso et al., 1996, Chrom. Res.. 4:226-239; each of which is 
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incorporated herein by reference in its entirety), were combined with 1-12 
/j\ of the transfection agent. In the case of LipofectAMINE PLUS, the 
PLUS reagent was combined with the ACes for 15 minutes followed by 
LipofectAMINE for a further 1 5 minutes. Superfect was complexed for 
5 10 minutes at a ratio of 2fj\ Superfect per 1 million ACes. The 

/ICes/transfection agent complex was then applied to 0.5 million recipient 
cells and the transfection was allowed to proceed according to the 
manufacturer's protocol. Percent transfected cells was determined on a 
FACS Vantage flow cytometer with argon laser tuned to 488 nm at 

10 200mW and FITC fluorescence collected through a standard FITC 530/30 
nm band pass filter. After 24 hours, IdUrd labeled ACes were delivered 
to human MSCs in the range of 30-50%, varying with transfection agent 
and dose. ACes delivery curves were generated from data collected in 
experiments that varyied the dose of the transfection reagents. Dose 

15 response curves of Superfect and LipofectAMINE PLUS, showing delivery 
of ACes into recipient hMSCs cells, were prepared, measured by transfer 
of IdUrd labeled ACes and detected by flow cytometry. Superfect shows 
maximum delivery in the range of 30-50% at doses greater than 2 /A per 
million ACes. LipofectAMINE PLUS has a 42-48% delivery peak around 

20 5-8 //I per million ACes. These dose curves were then correlated with 
toxicity data to determine the transfection conditions that will allow for 
highest potential transfection efficiency. Toxicity was determined by a 
modified plating efficiency assay (de Jong et aL, 2001 . Chrom. Research. 
9:475-485). The population's normalized plating efficiency (at maximum 

25 % delivery doses) was in the range of 0.2 - 0.4 for Superfect and 0.5 - 
0.6 with LipofectAMINE PLUS. 

Due to the transfected population consisting of mixed cell types, 
flow cytometry allowed for the assessment of ACes delivery into each 
sub-population and the purification of the target population. Flow profiles 
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showing forward scatter (cell size) and side scatter (internal cell 
granularity) revealed three distinct hMSC populations that were gated into 
three regions: R3 (small cell region), R4 (medium cell region), R5 (large 
cell region). Transfection conditions were further optimized by re- 
5 analyzing delivery curves and assessing the differences in delivery to each 
sub-population. Dose response curves of Superfect and LipofectAMINE 
were prepared showing % delivery to each sub-population represented by 
the gating on basis of cell size and granularity properties of the mixed 
population. Three distinct hMSC populations were gated and % delivery 

10 dose curves generated. Using Superfect and LipofectAMINE PLUS the 
overall % delivery increased with cell size (80-90% delivery in large cells). 
LipofectAMINE PLUS at high doses (8-1 2 //I per 1 million ACes) shows an 
increase in the overall proportion of chromosome transfer to the small 
population (10-20%). This suggests an advantage to using this 

15 transfection agent if the small-undifferentiated cell population is the 
desired target host cell. 

B. Expression from Genes on ACes IN hMSCs 
Following the delivery screening process conducted in section (A) 
above, the most promising results were subjected to further analyses to 

20 monitor expression and verify the presence of structurally intact ACes. 

The transfection conditions employed for these experiments were exactly * 
the same as those that had been used during the screening process. 
Short-term expression was monitored by transfecting hMSCs with ACes 
containing a RFP gene (red fluorescent protein) set forth in Example 2C as 

25 "D1 1C4". The unselected population was harvested at 72-96 hours post 
transfection and % positive fluorescent cells measured by flow 
cytometry. RFP expression was in the range of 1-20%. 

Long term-gene expression was assayed by selecting for 
hygromycin B resistant cells over a period of 7-10 days. Cytogenetic 



-116- 

analysis was done to detect presence of intact ACes by Fluorescent In 
Situ hybridization (FISH), where metaphase chromosomes were hybridized 
to a mouse major satellite-DNA probe (targeting murine pericentric 
heterochromatin) and a lambda probe (hybridizing to the lacZ gene). The 
5 human mesenchymal transfected culture could not undergo standard sub- 
cloning as diffuse colonies form with limited doublings available for 
expansion. Cytogenetic analysis was performed on the entire population, 
sampling over a period of 3-10 days post-transf action. The hygromycin 
resistant population was then blocked in mitosis with colchicine and 

10 analyzed for presence of intact ACes by FISH. Preliminary FISH results 
show approximately 2-8% of the hMSC-transfected population had an 
intact ACes. This compared to rat skeletal muscle myoblast clones, 
which were in the range of 60-95%. To increase the % of intact ACes in 
the hMSC-transfected population an enrichment step can be utilized. as 

15 described in Example 2C. 

C. Differentiation of The hMSCs 

in initial experiments where transfected hMSCs cells have been 
induced to differentiate into adipose or osteocytes, the results indicate 
that the transfected cells appear to be differentiating at a rate comparable 

20 to the untransfected controls and the cultures are lineage specific as 
tested by microscopic examination, FISH, Oil Red O staining (adipocyte 
assay), and calcium secretion (osteocyte assay). 

Accordingly, these results indicate that the artificial chromosomes 
{ACes) provided herein can be successfully transferred into hMSC target 

25 cells. Targeting MSCs (such as hMSCs) permits gene transfer into cells in 
an undifferentiated state where the cells are easier to expand and purify. 
The genetically modified cells can then be differentiated in vitro or 
injected into a site in vivo where the microenvironment will induce 
transformation into specific cell lineages. 
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EXAMPLE 7 

Delivery of a Promoterless Marker Gene to a Platform ACes 

Platform ACes containing pSV40attPsensePURO (Figure 4) were 
constructed as set forth in Examples 3 and 4. 
5 A. Construction of Targeting Vectors. 

The base vector pISattBZeo (3166bp; SEQ ID NO: 1 14) was 
constructed by ligating the 1067bp A///7dlll-Sspl fragment containing 
attBZeo, obtained from pLITattBZeo (SEQ ID N0:91), into pUCIS (SEQ ID 
NO: 122) digested with Hm6\\\ and Ssp\. 
10 1. p18attBZE0-eGFP (6119bp; SEQ ID NO: 126) was constructed 

by inserting the 2977bp 5'pel-/y'//7dlll fragment from pCXeGFP (SEQ ID 
N0:71; Okabe, eta/. (1997) FEBS LeU 4Q7'.3^3-3^9) containing the eGFP 
gene into pISattBZeo (SEQ ID NO: 114) digested with Hfnm and Xba\. 

2. p18attBZEO-5'6XHS4eGFP (Figure 10; 7631 bp; SEQ ID NO: 
15 116) was constructed by ligating the 4465bp M>7dlll fragment from 

pCXeGFPattB(6XHS4)2 (SEQ ID NO: 123), which contains the eGFP gene 
under the regulation of the chicken beta actin promoter, 6 copies of the 
HS4 core element located 5' of the chicken beta actin promoter and the 
polyadenylation signal, into the Hind\\\ site of pISattBZeo (SEQ ID NO: 
20 114). 

3. p18attBZEO-3'6XHS4eGFP (Figure 11; 7600bp; SEQ ID NO: 
115) was created by removing the 5'6XHS4 element from pISattBZeo- 
(6XHS4)2eGFP (SEQ ID NO: 110). p18attBZeo-(6XHS4)2eGFP was 
digested with £coRV and Spe\, treated with Klenow and religated to form 

25 pi 8attBZeo3'6XHS4eGFP (SEQ ID NO: 115). 

4. p18attBZEO-(6XHS4)2eGFP (Figure 12; 9080bp; SEQ ID NO: 
110) was created in two steps. First, the EcoR\-Spe\ fragment from 
pCXeGFPattB(6XHS4)2 (SEQ ID NO: 123), which contains 6 copies of the 
HS4 core element, was ligated into pISattBZeo (SEQ ID NO: 114) 
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digested with EcoRI and Xba\ to create p18attBZeo6XHS4 (461 5bp; SEQ 
ID NO: 117). Next, p18attBZeo6XHS4 was digested with M>idlll and 
ligated to the 4465bp HindW fragment from pCXeGFPattB(6XHS4)2 
which contains the eGFP gene under the regulation of the chicken beta 
5 actin promoter, 6 copies of the HS4 core element located 5' of the 
chicken beta actin promoter and the polyadenylation signal. 

Table 2 



Targeting plasmid 


No. zeocin 

resistant 

clones 


No. clones with 
expected PGR 
product size 


No. clones wltli correct 
sequence at 
recombination junction 


p18attBZE0-eGFP 


12 


12 


NT* 


pi 8attBZEO-5'6XHS4eGFP 


11 


11 


NT 


pi 8attBZE0-3'6XHS4eGFP 


11 


11 


NT 


pi 8attBZE0-(6XHS4)2eGFP 


9 


9 


4/4 



*NT = not tested 



B. Transfection and Selection with Drug. 

1 5 The mouse cell line containing the 2"^ generation platform ACE, 

B19-38 (constructed as set forth in Example 3), was plated onto four 
10cm dishes at approximately 5 million cells per dish. The cells were 
incubated overnight in DMEM with 10% fetal calf serum at 37^C and 5% 
CO2. The following day the cells were transfected with 5/jg of each of 

20 the 4 vectors listed in Example 7.A. above and 5jt/g of pCXLamlntR (SEQ 
ID NO: 112), for a total of 10//g per 10cm dish. Lipofectamine Plus 
reagent was used to transfect the cells according to the manufacturers 
protocol. Two days post-transfection zeocin was added to the medium at 
500//g/mL The cells were maintained in selective medium until colonies 

25 formed. The colonies were then ring-cloned (see, e.g., McFarland, 2000, 
Methods Cell Sci, Mar;22(1):63-66). 
0. Analysis of Clones (PGR, SEQUENCING). 
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Genomic DNA was isolated from each of the candidate clones with 
the Wizard kit (Promega) and following the manufacturers protocol. The 
following primer set was used to analyze the genomic DNA Isolated from 
the zeocin resistant clones: 5PacSV40 t- 
5 CTGTTAATTAACTGTGGAATGTGTG TCAGTTAGGGTG (SEQ ID NO:76>; 
Antisense Zeo - TGAACAGGGTCACGTCGTCC (SEQ ID NO:77). PCR 
amplification with the above primers and genomic DNA from the site- 
specific integration of any of the 4 zeocin vectors would result in a 673bp 
PCR product. 

10 As set forth in Table 2, of the 4 zeocin resistant candidate clones 

thusf ar analyzed by PCR, all 4 exhibit the correct sequence for a site- 
specific integration event. 

EXAMPLE 8 

Integration of a PCR product by site-specific recombination. 

15 In this example a gene is integrated onto the platform ACes by site- 
specific recombination without cloning said gene into a vector. 
A. PCR PRIMER DESIGN. 

PCR primers are designed to contain an attB site at the 5' end of 
one of the primers in the primer set. The remaining primers, which could 

20 be one or more than one primer, do not contain an attB site, but are 
complementary to sequences flanking the gene or genes of interest and 
any associated regulatory sequences. In first example, 2 primers (one 
containing an attB site) are used to amplify a selective gene such as 
puromycin. 

25 In a second example as shown in Figure 13, the primer set includes 

primers 1 & 2 that amplify the GFP gene without amplification of an 
upstream promoter. Primer 1 contains the attB site at the 5' end of the 
oligo. Primers 3 & 4 are designed to amplify the IRES-blasticidin DNA 
sequences from the vector pIRESblasticidin. The 5'end of primer 3 
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contains sequences complementary to the 5' end of primer 2 such that 
annealing can occur between 5' ends of the two primers, 

B. PGR REACTION AND SUBSEQUENT LIGATION TO CREATE 
CIRCULAR MOLECULES FROM THE PCR PRODUCT 

5 In the first example set forth above in Section A, the two PCR 

primers are combined with a puromycin DNA template such as pPUR 
(Clontech), a heat stable DNA polymerase and appropriate conditions for 
DNA amplification. The resulting PCR product (attB-Puromycin) is then 
then purified and self-ligated to form a circular molecule. 

10 In the second example set forth above in Section A, amplification 

of the GFP gene and IRES-blasticidin sequences is accomplished by 
combining primers 1 & 2 with DNA template pD2eGFP and primers 3 & 4 
with template pIRESblasticidin under appropriate conditions to amplify the 
desired template. After initial amplification of the two products (attB-GFP 

15 & IRES-blasticidin) in separate reactions, a second round of amplification 
using both of the PCR products from the first round of amplification 
together with primers 1 and 4 amplifies the fusion product attB-GFP-IRES- 
blasticidin (Figure 13). This technique of using complementary sequences 
in primer design to create a fusion product is employed in Saccharomyces 

20 cerevisiae for allele replacement (Erdeniz ef a/ (1 997) Gen Res 7: 1 1 74- 
1 183). The amplified product is then purified from the PCR reaction 
mixture by standard methods and ligated to form a circular molecule. 

C. INTRODUCTION OF PCR PRODUCT ONTO THE ACes USING A 
RECOMBINASE 

25 The circular PCR product is then be introduced to the platform 

ACes using the bacteriphage lambda integrase E174R. The introduction 
can be performed in vivo by transfecting the pCXLamlntR (SEQ ID NO: 
1 12) vector encoding the lambda integrase mutant E174R together with 
the circularized PCR product into a cell line containing the platform ACE- 
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D. SELECTION FOR MARKER GENE 

The marker gene (in this case either puromycin, blasticidin or GFP) 
Is used to enrich the population for cells containing the proper integration 
event. A proper integration event in the second example (Figure 1 4) 
5 juxtaposes a promoter residing on the platform ACes 5' to the attB-GFP- 
IRES-Blasticidin PGR product, allowing for transcription of both GFP and 
blasticidin. If enrichment is done by drug selection, blasticidin is added to 
the medium on the transfected cells 24-48 hours post-transf action. 
Selection is maintained until colonies are formed on the plates. If 
10 enrichment is done by cell sorting, cells are sorted 2-4 days post- 

transfection to enrich for cells expressing the fluorescent marker (GFP in 
this case). 

E. ANALYSIS OF CLONES 

Clonal isolates are analyzed by PGR, FISH and sequence analysis to 
\ 15 confirm proper integration events. 

EXAMPLE 9 

Construction of a human platform i4Cd5 "ACE 0.1" 

A. CONSTRUCTION OF THE TARGETING VECTOR pPACrDNA 

Genome Systems (IncyteGenomics) was supplied with the primers 
20 5'HETS (GGGCCGAAACGATCTCAACCTATT; SEQ ID NO:78), and 

3'HETS (CGCAGCGGCCCTCCTACTC; SEQ ID NO:79), which were used 
to amplify a 538bp PGR product homologous to nt 9680-10218 of the 
human rDNA sequences (GenBank Accession No. U13369 ) and used as a 
probe to screen a human genomic PI AC (PI Artificial Chromosome) 
25 library constructed in the vector pCYPAC2 (loannou eta/. (1994) Nat 
Genet 6(1): 84-89). Genome Systems clone #18720 was isolated in this 
screen and contains three repeats of human rDNA as assessed by 
restriction analysis, GS clone #18720, was digested with Pmel, a 
restriction enzyme unique to a single repeat of the human rDNA (45Kbp), 
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and then religated to form pPACrDNA (Figure 15). The irisert in 
pPACrDNA was analyzed by restriction digests and sequence analysis of 
the 5' and 3' termini. The pPACrDNA, rDNA sequences are homologous 
to Genbank Accession #U 13369, containing an insert of about 45 IcB 
comprising a single repeat beginning from the end of one repeat at 
-33980 (relative to the Genbank sequence) through the beginning of the 
next repeat up to approximately 35120 (the repeat offset from that listed 
in the GenBank file). Thus, the rDNA sequence is just over 1 copy of the 
repeat extending from 33980 (+/-10bp) to the end of the first repeat 
(43Kbp) and continuing into the second repeat to bp 35120 (+/-10bp). 

B. TRANSFECTION AND ACes FORMATION. 

Five hundred thousand MSU1.1 cells (Morgan et al., 1991, Exp. 
Cell Res., Nov;197(1):125-136; provided by Dr. Justin McCormick at 
Michigan State University) were plated per 6cm plate (3 plates total) and 
allowed to grow overnight. The cells were 70-80% confluent the 
following day. One plate was transfected with 1 B/jq pPACrDNA 
(linearized with Pme I) and 2/i/g pSV40attPsensePuro (linearized with Sea 
I; see Example 3). The remaining plates were controls and were 
transfected with either 20//g pBS (Stratagene) or 20//g 
pSV40attBsensePuro (linearized with Sea I). All three plates were 
transfected using a CaP04 protocol. 

C. SELECTION OF PUROMYCIN RESISTANT COLONIES 

One day post-transfection the cells were "glycerol shocked" by the 
addition of PBS medium containing 10% glycerol for 30 seconds. 
Subsequently, the glycerol was removed and replaced with fresh DMEM. 
Four days post-transfection selective medium was added. Selective 
medium contains 1//g/ml puromycin. The transfection plates were 
maintained at 37°C with 5% COj in selective medium for 2 weeks at 
which point colonies could be seen on the plate transfected with 
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pPACrDNA and pSV40attPsensePuro. The colonies were ring-cloned 
from the plate on day 1 7 post-selection and expanded in selective 
medium for analysis. Only two colonies (M2-2d & M2-2b) were able to 
proliferate in the selective medium after cloning. No colonies were seen 
5 on the control plates after 37 days in selective medium. 
D. ANALYSIS OF CLONES 

FISH analysis was performed on the candidate clones to detect 
ACes formation. Metaphase spreads from the candidate clones were 
probed in multiple probe combinations. In one experiment, the probes 

10 used were biotin-labeled human alphoid DNA (pPACrDNA) and 

digoxigenin-labeled mouse major DNA {pFK161) as a negative control. 
Candidate M2-2d was single cell subcloned by flow sorting and the 
candidate subclones were reanalyzed by FISH. Subclone 1B1 of M2-2d 
was determined to be a platform ACes and is also designated human 

15 Platform ACE 0.1. 

EXAMPLE 10 

Site-specific integration of a marlcer gene onto a human platform ACE 0.1 

The promoterless delivery method was used to deliver a 
promoteriess blasticidin marker gene onto the human platform ACes with 

20 excellent results. The human ACes platform with a promoterless 

blasticidin marker gene resulted in 21 of 38 blasticidin resistant clones 
displaying a PCR product of the expected size from the population co- 
transfected with pLIT38attBBSRpolyA10 and pCXLamlntR (Figure 8; SEQ 
ID NOs. 1 1 1 and 112). Whereas, the population transfected with 

25 pBlueScript resulted in 0 blasticidin resistant colonies. 

A. CONSTRUCTION OF pLIT38attB-BSRpolyA10 & pLIT38attB- 
BSRpolyA2. 

The vector pLITMUS 38 (New England Blolabs; U.S. Patent No. 
5,691,140; SEQ ID NO: 119) was digested with fcoRV and ligated to 
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two annealed oligomers, which form an attB site (attBI 5'- 
TGAAGCCTGC I I I I I TATACTAACTTGAGCGAA-3' (SEQ ID N0:8); attB2 
5'- TTCGCTCAAGTTAGTATAAAAAAGCAGQCTTCA-3'; SEQ ID N0:9). 
This ligation reaction resulted in the vector pLITSSattB (SEQ ID NO: 120). 
5 The blasticidin resistance gene and SV40 polyA site were PGR amplified 
with primers: 5BSD (ACCATGAAAACATTTAACATTTCTCAACA; SEQ ID 
N0:80) and SV40polyA (TTTATTTGTGAAATTTGTGATGCTATTQC; SEQ 
ID N0:81) using pPAC4 (Frengen, E., etal. (2000) Genomics 68 (2), 118- 
126; GenBanIc Accession No. U75992) as template. The blasticidin- 
10 SV40polyA PGR product was then iigated into pLITSSattB at the BamH\ 
site, which was Klenow treated following digestion with BamH\. 
pLIT38attB-BSDpolyA10 (SEQ ID NO: 111) and pLIT38attB-BSDpolyA2 
(SEQ ID NO: 121) are the two resulting orientations of the PGR product 
Iigated into the vector. 

15 B. TRANSFECTION OF IVISU 1.1 CELLS CONTAiNING HUIVIAN 
PLATFORM ACE 0.1. 

MSU1.1 cells containing human platform AGE 0.1 (see Example 9) 
were expanded and plated to five 10cm dishes with 1 .3x10^ cells per 
dish. The cells were incubated overnight in DMEM with 10% fetal bovine 

20 serum, at 3TC and 5% COj. The following day the cells were 

transfected with 5//g of each plasmid as set forth in Table 3, for a total of 
^O^IQ of DNA per plate of cells transfected (see Table 3) using ExGen 500 
in vitro transfection reagent (MBI fermentas, cat. no. R0511). The 
transfectlon was performed according to the manufacturers protocol. 

25 Cells were incubated at 37'*G with 5% GO2 in DMEM with 10% fetal 
bovine serum following the transfection. 
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Table 3 



Plate # 


Plasmtd 1 


Plasmid 2 


No. Bsd" Colonies 


1 


pBS 


None 


0 


2 


pCXLamInt 


pLIT38attB- 
BSRpolyMO 


16 


3 


pCXLamlntR 


pLITSSattB- 
BSRpotyAlO 


40 


4 


pCXLamInt 


pLITSBattB- 
BSRpolyA2 


28 


5 


pCXLamlntR 


pLITSBattB- 
BSRpolyA2 


36 



10 C. SELECTION OF BLASTICiDIN RESISTANT CLONES. 

Three days following the transfectlon the cells were split from a 10 
cm dish to two 15cm dishes. The cells were maintained in DMEM with 
10% fetal bovine serum for 4 days In the 15 cm dishes. Seven days 
post-transfection blasticidin was introduced into the medium. Stably 

15 transfected cells were selected with lyug/ml blasticidin. The number of 
colonies formed on each plate is listed in Table 3. These colonies were 
ring-cloned and expanded for PGR analysis. Upon expansion in blasticidin 
containing medium some clones failed to live and therefore do not have 
corresponding PGR data. 

20 D. PCR ANALYSIS 

Thirty-eight of the 40 clones from plate 3 grew after ring-cloning. 
Genomic DNA was isolated from these clones with the Promega Wizard 
Genomic cDNA purification kit, digested with EcoH\ and used as template 
in a PCR reaction with the following primers: 3BSP - TTAATTTCGGG 

26 TATATTTGAGTGGA (SEQ ID NO:82); 5PacSV40 - 

CTGTTAATTAACTGTGGAA TGTGTGTCAGTTAGGGTG (SEQ ID NO:76). 
The PCR conditions were as follows. lOOng of genomic DNA was 
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amplified with 0.5//I Herculase polymerase (Stratagene) in a 50//I reaction 
that contained 12.5pmole of each primer, 2.5mM of each dNTP, and IX 
Herculase buffer (Stratagene). The reactions were placed In a PerkinEImer 
thermocycler programmed as follows: Initial denaturation at 95**C for 10 
5 minutes; 35 cycles of 94**C for 1 minute, 53°C for 1 minute, 72'*C for 1 
minute, and 72°C for 1 minute; Final extension for 1 0 minutes at 72*^0; 
and 4**C hold. If pLIT38attB-BSRpolyA10 Integrates onto the human 
platform ACE 0.1 correctly, PCR amplification with the above primers 
should yield an 804bp product. Twenty-one of the 38 clones from plate 
10 3 produced a PCR product of the expected 804bp size. 

EXAMPLE 11 

Delivery of a Vector comprising a Promoterless Marker Gene and a gene 
encoding a therapeutic product to a Platform ACes 

Platform ACes containing pSV40attPsensePURO (Figure 4) were 
15 constructed as set forth in Examples 3 and 4. 
A. CONSTRUCTION OF DELIVERY VECTORS 

1. Erythropoietin cDNA vector, pISEPOcDNA. 

The erythropoietin cDNA was PCR amplified from a human cDNA 
library (E. Perkins eta/., 1999, Proc. Natl. Acad. ScL USA 96(5): 2204- 
20 2209) using the following primers: EP05XBA - 

TATCTAGAATGGGGGTGC ACGAATGTCCTGCC (SEQ ID NO: 83); 
EP03BSI « TACGTACGTCATC TGTCCCCTGTCCTGCAGGC (SEQ ID NO: 
84). The cDNA was amplified through two successive rounds of PCR 
using the following conditions: heat denaturation at 95**C for 3 minutes; 
25 35 cycles of a 30 second denaturation (95^C), 30 seconds of annealing 
(60*^0, and 1 minute extension {72*'C); the last cycle is followed by a 7 
minute extension at 72°C. BIO-X-ACT (BIOLINE) was used to amplify the 
erythropoietin cDNA from 2.5ng of the human cDNA library in the first 
round of amplification. Five //I of the first amplification product was used 
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as template for the second round of amplification. Two PGR products 
were produced from the second amplification with Taq polymerase 
(Eppendorf), each product was cloned into pCR2.1-Topo (Invitrogen) and 
sequenced. The larger PGR product contained the expected cDNA 
5 sequence for erythropoietin. The erythropoietin cDNA was moved from 
pTopoEPO into pi 8attBZeo(6XHS4)2eGFP (SEQ ID NO: 1 10). pTopoEPO 
was digested with BsiWI and Xbal to release a 588 bp EPO cDNA. BsrGI 
and BsiWI create compatable ends. The eGFP gene was removed from 
p18attBZeo(6XHS4)2eGFP by digestion with BsiWI and Xbal, the 8.3 Kbp 
10 vector backbone was gel purified and ligated to the 588 bp EPO cDNA to 
create pISEPOcDNA (SEQ ID NO: 124). 

2. Genomic erythropoietin vector, pISgenEPO. 
The erythropoietin genomic clone was PGR amplified from a human 
genomic library (Clontech) using the following primers: GENEP03BSI - 
1 5 CQTACGTCATCTGTGGGCT GTCCTGCA (SEQ ID NO: 85); GENEPO 

5XBA -TCTAGAATQGGQGT GGACGGTGAGTACT (SEQ ID NO: 86). The 
reaction conditions for the amplification were as follows: heat 
denaturation for 3 minutes {95"G); 30 cycles of a 30 second denaturation 
{95»C), 30 seconds annealing (from BSX decreasing 0.5''G per cycle to 
20 50"C), and 3 minutes extension (72°C); 15 cycles of a 30 second 
denaturation (95"'G), 30 seconds annealing (50"'C), and 3 minute 
extension (72°G); the last cycle is followed by a 7 minute extension at 
72°C. The erythropoietin genomic PGR product (2147 bp) was gel 
purified and cloned into pGR2.1Topo to create pTopogenEPO. Sequence 
25 analysis revealed 2bp substitutions and insertions in the intronic 

sequences of the genomic clone of erythropoietin. A partial digest with 
Xbal and complete digest with BsiWI excised the erythropoietin genomic 
insert from pTopogenEPO. The resulting 21 58 bp genomic erythropoietin 
fragment was ligated into the 8.3 Kbp fragment resulting from the 
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digestion of p18attBZeo(6XHS4)2eGFP (SEQ ID NO: 110) with Xbal and 

BsrGI to create pISgenEPO (SEQ ID NO: 125). 

B. TRANSFECTION AND SELECTION WITH DRUG 

The erythropoietin genomic and cDNA genes were each moved 
5 onto the platform ACes B19-38 (constructed as set forth in Example 3) by 
co-transfecting with pCXLamlntR. Control transfections were also 
performed using pCXLamInt (SEQ ID NO: 127) together with either 
pISEPOcDNA (SEQ ID NO: 124) or pISgenEPO (SEQ ID NO: 125). 
Lipofectamine Plus was used to transfect the DNA's into B19-38 cells 

10 according to the manufacturer's protocol. The cells were placed in 
selective medium (DMEM with 10% FBS and Zeocin @ 500ug/ml) 48 
hours post-transfection and maintained in selective medium for 13 days. 
Clones were isolated 15 days post-transfection. 
0. ANALYSIS OF CLONES (ELISA, PCR) 

15 1. ELISA Assays 

Thirty clones were tested for erythropoietin production by an ELISA 
assay using a monoclonal anti-human erythropoietin antibody (R&D 
Systems, Catalogue # MAB287), a polyclonal anti-human erythropoietin 
antibody (R&D Systems, Catalogue # AB-286-NA) and alkaline 

20 phosphotase conjugated goat-anti-rabbit IgG (heavy and light chains) 

(Jackson ImmunoResearch Laboratories, Inc., Catalogue # 111-055-144). 
The negative control was a Zeocin resistant clone isolated from B1 9-38 
cells transfected with p18attBZeo{6XHS4) (SEQ ID NO: 117; no insert 
control vector) and pCXLamlntR (SEQ ID NO: 112). The preliminary 

25 ELISA assay was executed as follows: 1) Nunc-lmmuno Plates (MaxiSorb 
96-well, Catalogue # 439454) were coated with 75//1 of a 1/200 dilution 
(in Phosphate buffered Saline, pH 7.4 (PBS), Sigma Catalogue # P-3813) 
of monoclonal anti-human erythropoietin antibody overnight at 4°C. 2) 
The following day the plates were washed 3 times with 300/y| PBS 
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containing 0.15% Tween 20 (Sigma, Catalogue # P-94ie). 3) The plates 
were then blocked with 300//I of 1 % Bovine Serum Albumin (BSA; Sigma 
Catalogue # A-7030) in PBS for 1 hour at 37''C. 4) Repeat the washes as 
in step 2. 5) The clonal supernatants {75//I per clone per well of 96-well 

5 plate) were then added to the plate and incubated for 1 hour at 37°C. 
The clonal supernatant analyzed in the ELISA assay had been maintained 
on the cells 7 days prior to analysis. 6) Repeat the washes of step 2. 7) 
Add 75//I of polyclonal anti-human erythropoietin antibody (1/250 dilution 
in dilution buffer (0,5% BSA, 0.01% Tween 20, IX PBS, pH 7.4) and 

10 incubate 1 hour at 37°C. 8) Repeat washes of step 2. 9) Add 75//I of 
goat anti-rabbit conjugated alkaline phosphatase diluted 1/4000 in dilution 
buffer and incubate 1 hour at 37''C. 10) Repeat washes of step 2. 11) 
Add 75iWl substrate, p-nitrophenyl phosphate (Sigma N2640), diluted to 
Img/ml In substrate buffer (0.1 Ethanolamlne-HCI (Sigma, Catalogue # E- 

15 6133), 5mM MgCIa (Sigma, Catalogue # M-2393), pH 9.8). Incubate the 
plates in the daric for 1 hour at room temperature (22°C). 12) Read the 
absorption at 405nm (reference wavelength 495nm) on an Universal 
Microplate Reader (Bio-Tek Instruments, Inc., model # ELX800 UV). The 
erythropoietin standard curve was derived from readings of diluted human 

20 recombinant Erythropoietin (Roche, catalogue # 1-120-166; dilution range 
125 - 7.8mUnits/ml). From this preliminary assay the 21 clones 
displaying the highest expression of erythropoietin were analyzed a 
second time in the same manner using medium supernatants that had 
been on the clones for 24 hours and a 1 :3 dilution therof . 

25 2. PCR Analysis 

Genomic DNA was isolated from the 21 clones with the best 
expression (as assessed by the initial ELISA assay above) as well as the 
819-38 cell line and used for PCR analysis. Genomic DNA was isolated 
using the Wizard genomic DNA purification kit (Promega) according to the 
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manufacturers protocol. Amplification was performed on lOOng of 
genomic DNA as template with MasterTaq DNA Polymerase (Eppendorf) 
and the primer set 5PacSV40 - CTGTTAATTAACTGTGGAATGTGTG 
TCAGTTAGGGTG (SEQ ID NO: 76) and Antisense Zeo - 
TGAACAGGGTCACGTCGTCC (SEQ ID NO: 77). The amplification 
conditions were as follows: heat denaturation for 3 minutes (95»C); 30 
cycles of a 30 second denaturation OS^C), 30 seconds annealing (from 
65oC decreasing 0.5oC per cycle to 50"*C), and 1 minutes extension 
(72'C); 15 cycles of a 30 second denaturation (95'C), 30 seconds 
annealing (SO'C), and 1 minute extension {72°C); the last cycle Is 
followed by a 1 0 minute extension at 72°C. PGR products were size 
separated by gel electrophoresis. Of the 21 clones analyzed 19 produced 
a PGR product of 650 bp as expected for a site-specific Integration event. 
All nineteen clones were the result of transformations with p19EP0cDNA 
(5) or pISgenEPO (14) and pGXLamlntR (i.e. mutant integrase). The 
remaining two clones, both of which were the result of transformation 
with pISgenEPO (SEQ ID NO: 125) and pCXLamInt (i.e. wildtype 
integrase; SEQ ID NO: 127), produced a 400 bp PGR product. 



Preparation of a Transformation Vector Useful for the Induction of Plant 
Artificial Chromosome Formation 

Plant artificial chromosomes (PAGs) can be generated by 
introducing nucleic acid, such as DNA, which can include a targeting 
DNA, for example rDNA or lambda DNA, into a plant cell, allowing the cell 
to grow, and then identifying from among the resulting cells those that 
include a chromosome with a structure that is distinct from that of any 
chromosome that existed in the cell prior to introduction of the nucleic 
acid. The structure of a PAG reflects amplification of chromosomal DNA, 
for example, segmented, repeat region-containing and heterochromatic 
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structures. It is also possible to select cells that contain structures that 
are precursors to PACs, for example, chromosomes containing more than 
one centromere and/or fragments thereof, and culture and/or manipulate 
them to ultimately generate a PAC within the cell. 
5 In the method of generating PACs, the nucleic acid can be 

introduced into a variety of plant cells. The nucleic acid can include 
targeting DNA and/or a plant expressable DNA encoding one or multiple 
selectable markers {e.g., DNA encoding bialophos (bar) resistance) or 
scorable markers {e.g., DNA encoding GFP). Examples of targeting DNA 

10 include, but are not limited to, N. tabacum rDNA intergenic spacer 

sequence (IGS) and Arabidopsis rDNA such as the 18S, 5.8S, 26S rDNA 
and/or the intergenic spacer sequence. The DNA can be introduced using 
a variety of methods, including, but not limited to Agrobacterium- 
mediated methods, PEG-mediated DNA uptake and electroporation using, 

15 for example, standard procedures according to Hartmann etal [(1998) 
Plant Molecular Biology 55:741]. The cell into which such DNA is 
introduced can be grown under selective conditions and can initially be 
grown under non-selective conditions and then transferred to selective 
media. The cells or protoplasts can be placed on plates containing a 

20 selection agent to grow, for example, individual calli. Resistant calli can 
be scored for scorable marker expression. Metaphase spreads of resistant 
cultures can be prepared, and the metaphase chromosomes examined by 
FISH analysis using specific probes in order to detect amplification of 
regions of the chromosomes. Cells that have artificial chromosomes with 

25 functioning centromeres or artificial chromosomal intermediate structures, 
including, but not limited to, dicentric chromosomes, formerly dicentric 
chromosomes, minichromosomes, heterochromatin structures (e.g. 
sausage chromosomes), and stable self-replicating artificial chromosomal 
intermediates as described herein, are 
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identified and cultured. In particular, the cells containing self-replicating 
artificial chromosomes are identified. 

The DNA introduced into a plant cell for the generation of PACs 
can be in any form. Including in the form of a vector. An exemplary 
5 vector for use in methods of generating PACs can be prepared as follows. 
For the production of artificial chromosomes, plant transformation 
vectors, as exemplified by pAglla and pAgllb, containing a selectable 
marker, a targeting sequence, and a scorable marker were constructed 
using procedures well known in the art to combine the various fragments. 
10 The vectors can be prepared using vector pAgI as a base vector and 
inserting the following DNA fragments into pAgI : DNA encoding p- 
glucoronidase under the control of the nopaline synthase (NOS) promoter 
fragment and flanked at the 3' end by the NOS terminator fragment, a 
fragment of mouse satellite DNA and an N. tabacum rDNA intergenic 
15 spacer sequence (IGS). In constructing plant transformation vectors, 
vector pAg2 can also be used as the base vector. 
1 . Construction of pAG 1 

Vector pAgI (SEQ. ID. NO: 89) is a derivative of the GAMBIA 
vector named pCambia 3300 (Center for the Application of Molecular 
20 Biology to International Agriculture, i.e., GAMBIA, Canberra, Australia; 
www.cambia.org), which is a modified version of vector pGambia 1300 
to which has been added DNA from the bar gene confering resistance to 
phosphinothricin. The nucleotide sequence of pGambia 3300 is provided 
in SEQ. ID. NO: 90. pGambia 3300 also contains a laoZ alpha sequence 
25 containing a polylinker region. 

pAgI was constructed by inserting two new functional DNA 
fragments into the polylinker of pCambia 3300: one sequence containing 
an attB site and a promoterless zeomycin resistance-encoding DNA 
flanked at the 3' end by a SV40 polyA signal sequence, and a second 
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sequence containing DNA from the hygromycin resistance gene 
(hygromycin phosphotransferase) confering resistance to hygromycin for 
selection in plants. Although the zeomycin-SV40 polyA signal fusion is 
not expected to function in plant cells, it can be activated in mammalian 
5 cells by insertion of a functional promoter element Into the attB site by 
site-specific recombination catalyzed by the Lambda att integrase. Thus, 
the inclusion of the attB-zeomycin sequences allows for evaluation of 
functionality of plant artificial chromosomes in mammalian cells by 
activation of the zeomycin resistance-encoding DNA, and provides an att 

10 site for further insertion of new DNA sequences into plant artificial 

chromosomes formed as a result of using pAgI for plant transformation. 
The second functional DNA fragment allows for selection of plant cells 
with hygromycin. Thus, pAgI contains DNA from the bar gene confering 
resisance to phosphinothricin, DNA from the hygromycin resistance gene, 

15 both resistance-encoding DNAs under the control of a separate 

cauliflower mosaic virus (CaMV) 35S promoter, and the attB-promoterless 
zeomycin resistance-encoding DNA. 

pAgI is a binary vector containing Agrobacterium right and left T- 
DNA border sequences for use in Agrobactenum-r(\Bd\Bted transformation 

20 of plant cells or protoplasts with the DNA located between the border 
sequences. pAgI also contains the pBR322 Ori for replication in E.coli. 
pAgI was constructed by ligating Mnc/lll/Psfl-digested p3300attBZeo 
with Mr?£/lll/P5fl-digested pBSCaMV35SHyg as follows, 
a. Generation of p3300attBZeo 

25 Plasmid pCambia 3300 was digested with Pstl/Ecn 36 II and ligated 

with Pst\/Stul-d\gested pLITattBZeo (the nucleotide sequence of 
pLITattBZeo is provided in SEQ, ID. NO: 91), which contains DNA 
encoding the zeocin resistance gene and an attB Integrase recognition 
sequence, to generate p3300attBZeo which contains an attB site, a 

30 promoterless 
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zeomycin resistance-encoding DNA flanked at the 3' end by a SV40 
polyA signal, and a reconstructed PsA site. 

b. Generation' of pBSCaMVSSSHyg 

A DNA fragment containing DNA encoding hygromycin 
phosphotransferase flanked by the CalN^V 35S promoter and the CaMV 
35S polyA signal sequence was obtained by PGR amplification of plasmid 
pCambIa 1302 (GenBank Accession No. AF234298 and SEQ. ID. NO: 
92). The primers used in the amplification reaction were as follows: 
CaMV35SpolyA: 

5'-CTGAATTAACGCCGAATTAATTCGGGGGATCTG-3' SEQ. ID. NO: 93 
CaMV35Spr: 

5'-CTAGAGCAGCTTGCCAACATGGTGGAGCA-3' SEQ. ID. NO: 94 
The 2100-bp PGR fragment was ligated with £coRV-dlgested pBluescript 
II SK+ (Stratagene, La Jolla, CA, U.S.A.) to generate pBSGaMV35SHyg. 

c. Generation of pAgI 

To generate pAgI, pBSCaMV35SHyg was digested with 
Hind\\\/Pst\ and ligated with ///WcflllZ/^sfl-digested p3300attBZeo. Thus, 
pAgI contains the pGambIa 3300 backbone with DNA conferring 
resistance to phophinothricin and hygromycin under the control of 
separate GaMV 35S promoters, an attB-promoterless zeomycin 
resistance-encoding DNA recombination cassette and unique sites for 
adding additional markers, e.g., DNA encoding GFP. The attB site can be 
used as decribed herein for the addition of new DNA sequences to plant 
artificial chromosomes, including PAGs formed as a result of using the 
pAgI vector, or derivatives thereof, in the production of PACs. The attB 
site provides a convenient site for recombinase-mediated insertion of 
DNAs containing a homologous att site. 
2. pAG2 
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The vector pAg2 (SEQ. ID, NO: 95) is a derivative of vector pAgI 
formed by adding DNA encoding a green fluorescent protein (GFP), under 
the control of a NOS promoter and flanked at the 3' end by a NOS polyA 
signal, to pAgl. pAg 2 was constructed as follows. A DNA fragment 
containing the NOS promoter was obtained by digestion of pGEM-T-NOS, 
or pGEMEasyNOS (SEQ. ID. NO: 96), containing the NOS promoter in the 
cloning vector pGEM-T-Easy (Promega Biotech, Madison, Wl, U.S.A.), 
with Xba\INco\ and was ligated to a Xba\INco\ fragment of pCambia 1302 
containing DNA encoding GFP (without the CaMV 35S promoter) to 
generate p1302NOS (SEQ. ID. NO: 97) containing GFP-encoding DNA in 
operable association with the NOS promoter. Plasmid p1302NOS was 
digested with Sma\IBsi\N\ to yield a fragment containing the NOS 
promoter and GFP-encoding DNA. The fragment was ligated with 
fmel/fis/WI-dlgested pAgI to generate pAg2. Thus, pAg2 contains DNA 
from the bar gene confering resistance to phosphinothricin, DNA 
conferring resistance to hygromycin, both resistance-encoding DNAs 
under the control of a cauliflower mosaic virus 35S promoter, DNA 
encoding kanamycin resistance, a GFP gene under the control of a NOS 
promoter and the attB-zeomycin resistance-encoding DNA. One of skill in 
the art will appreciate that other fragments can be used to generate the 
pAgI and pAg2 derivatives and that other heterlogous DNA can be 
Incorporated into pAgI and pAg2 derivatives using methods well known 
in the art. 

3. pAglia and pAgllb transformation vectors 

Vectors pAglla and pAgllb were constructed by inserting the 
following DNA fragments into pAgI: DNA encoding iff-glucoronidase, the 
nopaline synthase terminator fragment, the nopaline synthase (NOS) 
promoter fragment, a fragment of mouse satellite DNA and an N. tabacum 
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rDNA intergenic spacer sequence (IGS). The construction of pAglla and 
pAgllb was as follows. 

An N. tabacum rDNA intergenic spacer (IGS) sequence (SEQ. ID. 
NO: 98; see also GenBank Accession No. Y08422; see also Borysyuk et 
5 a/. (2000) Nature Biotechnology 18: 1 303- 1 306; Borysyuk et al. ( 1 997) 
Plant Mol. Biol. 35:655-660; U.S. Patent Nos. 6,100,092 and 6,355,860) 
was obtained by PCR amplification of tobacco genomic DNA. The IGS 
can be used as a targeting sequence by virtue of its homology to tobacco 
rDNA genes; the sequence is also an amplification promoter sequence in 
10 plants. This fragment was amplified using standard PCR conditions [e.g., 
as described by Promega Biotech, Madison, Wl, U.S.A.) from tobacco 
genomic DNA using the primers shown below: 
NTIGS-FI 

5'- GTG CTA GCC AAT GTT TAA CAA GAT G- 3' (SEQ ID No. 99) and 
15 NTIGS-RI 

5'-ATG TCT TAA AAA AAA AAA CCC AAG TGA C- 3' (SEQ ID No. 100) 
Following amplification, the fragment was cloned into pGEM-T Easy to 
give pIGS-l A fragment of mouse satellite DNA (Msati fragment; 
GenBank Accession No. V00846; and SEQ ID No. 101) was amplified via 
20 PCR from pSAT-1 using the following primers: 
MSAT-F1 

5'- AAT ACC GCG GAA GCT TGA CCT GGA ATA TCG C -3'(SEQ ID No. 

102) and 

MSAT-Ri 

25 5'-ATA ACC GCG GAG TCC TTC AGT GTG CA T- 3' (SEQ' ID No. 103) 
This amplification added a Sacll and a Hincl\\\ site at the 5'end and a Sacll 
site at the 3' end of the PCR fragment. This fragment was then cloned 
into the Sacll site in plGS-1 to give pMIGS-1 , providing a eukaryotic 
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centromere-speclfic DNA and a convenient DNA sequence for detection 
via FISH. 

A functional marker gene containing a NOS-promoter:GUS:NOS 
terminator fusion was then constructed containing tlie NOS promoter 
5 (GenBanic Accession No. U09365; SEQ ID No. 104), E coli 
iff-glucuronidase coding sequence (from the GUS gene; GenBanic 
Accession No. S69414; and SEQ ID No. 105), and the nopaline synthase 
terminator sequence (GenBanic Accession No. U09365; SEQ ID No. 107). 
The NOS promoter in pGEM-T-NOS was added to a promoterless GUS 

10 gene in pBlueScript {Stratagene, La Jolla, CA, U.S.A.) using Not\ISpe\ to 
form pNGN-1, which has the NOS promoter in the opposite orientation 
relative to the GUS gene. 

pMIGS-1 was digested with Not\/Spe\ to yield a fragment 
containing the mouse major satellite DNA and the tobacco IGS which was 

15 then added to /Vofl-digested pNGN-1 to yield pNGN-2. The NOS promoter 
was then re-oriented to provide a functional GUS gene, yielding pNGN-3, 
by digestion and religation with Spel. Plasmid pNGN-3 was then digested 
with Hind\\\, and the Hind\\\ fragment containing the i?-glucuronidase 
coding sequence and the rDNA intergenic spacer, along with the Msat 

20 sequence, was added to pAG-1 to form pAglla {SEQ ID NO: 108), using 
the unique Hincl\\\ site in pAgI located near the right T-DNA border of 
pAgI, within the T-DNA region. 

Another plasmid vector, referred to as pAgllb, was also recovered, 
which contained the inserted Hindlll fragment (SEQ ID NO: 108) in the 

25 opposite orientation relative to that observed in pAglla. Thus, pAglla and 
pAgllb differ only in the orientation of the Hindlll fragment containing the 
mouse major satellite sequence, the GUS DNA sequence and the IGS 
sequence. The nucleotide sequence of pAglla is provided in SEQ. ID. NO: 
109. 
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Since modifications will be apparent to those of skill in this art, it Is 
intended that this invention be limited only by the scope of the appended 
claims. 
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WHAT IS CLAIMED IS: 

1 . A eukaryotic chromosome comprising one or a plurality of att 
site(s), wherein: 

an att site Is heterologous to the chromosome; and 
5 an att site permits site-directed integration in the presence of 

lambda integrase. 

2. The eukaryotic chromosome of claim 1 , wherein the att sites 
are selected from the group consisting of attP and aftB or a«L and aftR, 
or variants thereof. 

1 0 3. The eukaryotic chromosome of claim 1 that is an artificial 

chromosome. 

4. The eukaryotic chromosome of claim 1 that is an artificial 
chromosome expression system [ACes). 

5. The eukaryotic chromosome of claim 4 that is predominantly 
1 5 heterochromatin. 

6. The chromosome of claim 1 that is an artificial chromosome 
that contains no more than about 30%, 40%, 50%, 60%, 70%, 80%, 
90% or 95% euchromatin. 

7. The chromosome of claim 1 that is a plant chromosome. 
20 8. The chromosome of claim 1 that is an animal chromosome, 

9. The chromosome of claim 7 that is a plant artificial 
chromosome. 

1 0. The chromosome of claim 8 that is an animal artificial 
chromosome. 

25 11. The chromosome of claim 8 that is a mammalian 

chromosome. 

12. The chromosome of claim 1 1 that is a mammalian artificial 
chromosome. 
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13. The chromosome of claim 6 that is an artificial chromosome 
expression system (ACes). 

14. A platform artificial chromosome expression system (ACes) 
comprising one or a plurality of sites that participate in recombinase 

5 catalyzed recombination. 

1 5. The ACes of claim 1 4 that contains one site. 

1 6. The ACes of claim 14 that is predominantly heterochromatin. 

17. The ACes of claim 14 that contains no more than about 
30%, 40%, 50%, 60%, 70%, 80%, 90% or 95% euchromatin. 

10 18. The ACes of claim 1 4 that is a plant ACes. 

1 9. The ACes of claim 1 4 that is an animal ACes. 

20. The ACes of claim 14 that is selected from a fish, insect, 
reptile, amphibian, arachnid or a mammalian ACes. 

21 . The ACes of claim 14 that is a fish ACes. 

15 22. The artificial chromosome expression system {ACes) of claim 

. 14, wherein the recombinase and site(s) are from the Cre/lox system of 
bacteriophage PI , the int/att system of lambda phage, the FLP/FRT 
system of yeast, the Gin/gix recombinase system of phage Mu, the Cin 
recombinase system, the Pin recombinase system of E. coli and the R/RS 
20 system of the pSRI plasmid, or any combination thereof. 

23. A method of introducing heterologous nucleic acid into a 
chromosome, comprising: 

contacting a chromosome of any of claims 1 or 1 4 with a nucleic 
acid molecule comprising both the heterologous nucleic acid and a 
25 recombination site, in the presence of a recombinase that promotes 
recombination between the sites in the chromosome and in the nucleic 
acid molecule. 
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24. The method of claim 23, wherein the recombinase is 
selected from the group consisting of Cre, Gin, Cin, Pin, FLP, a phage 
integrase and R from the pSRI plasmid. 

25. The method of claim 23, wherein the nucleic acid molecule 
5 encodes a therapeutic protein, antisense nucleic acid, or comprises an 

artificial chromosome. 

26. The method of claim 25, wherein the nucleic acid molecule 
comprises a yeast artificial chromosomes (YAC), a bacterial artificial 
chromosome (BAC) or an insect artificial chromosome (lAC). 

10 27. A combination, comprising, the chromosome of claim 1 and a 

first vector comprising the cognate recombination site, wherein the 
cognate recombination site is a site that recombines with the site 
engineered into the chromosome. 

28. The combination of claim 27, further comprising nucleic acid 
15 encoding a recombinase, wherein the nucleic acid is on a second vector 

or on the first vector, or on the ACes under an inducible promoter. 

29. The combination of claim 28, wherein the recombinase and 
sites are from the Cre/lox system of bacteriophage PI , the int/att system 
of lambda phage, the FLP/FRT system of yeast, the Gin/gix recombinase 

20 system of phage Mu, the Pin recombinase system of E. coli and the R/RS 
system of the pSRI plasmid, or any combination thereof. 

30. The combination of claim 28, wherein a vector is the plasmid 
pCXLamlntR. 

31. The combination of claim 27, wherein a vector is the plasmid 
25 pDsRedNI-attB. 

32. A kit, comprising the combination of claim 27 and optionally 
instructions for introducing heterologous nucleic acid into the 
chromosome. 
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33. A method for introducing heterologous nucleic acid into a 
platform artificial chromosome, comprising: 

(a) mixing an artificial chromosome comprising at least a first 
recombination site and a vector comprising at least a second 

5 recombination site and the heterologous nucleic acid; 

(b) incubating the resulting mixture in the presence of at least one 
recombination protein under conditions whereby recombination between 
the first and second recombination sites is effected, thereby introducing 
the heterologous nucleic acid into the artificial chromosome. 

10 34. The method of claim 33, wherein the artificial chromosome is 

an ACes. 

35. The method of claim 33, wherein said mixing step (a) is 
conducted in cells ex vivo. 

36. The method of claim 33, wherein said mixing step (a) is 
15 conducted extracellularly in an in vitro reaction mixture. 

37. The method of claim 33, wherein the at least one 
recombination protein is encoded by a bacteriophage selected from the 
group consisting of bacteriophage lambda, phi 80, P22, P2, 186, P4 and 
PI. 

20 38. The method of claim 37, wherein the at least one 

recombination protein is encoded by bacteriophage lambda, or mutants 
thereof. 

39. The method of claim 33, wherein at least one recombination 
protein is selected from the group consisting of Int, IHF, Xis and Cre, k^, 

25 Tn3 resolvase, Hin, Gin, Cin and Rp. 

40. The method of claim 32, wherein the recombination sites are 
selected from the group consisting of att and lox P sites. 
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41 . The method of claim 33, wherein the first and/or second 
recombination site contains at least one mutation that removes one or 
more stop codons. 

42. The method of claim 33, wherein the first and/or second 
5 recombination site contains at least one mutation that avoids hairpin 

formation. 

43. The method of claim 33, wherein the first and/or second 
recombination site comprises at least a first nucleic acid sequence 
selected from the group consisting of SEQ ID NOs:41-56: 

10 a) RKYCWGCTTTYKTRTACNAASTSGB (m-att) (SEQ ID N0:41); 

b) AGCCWGCTTTYKTRTACNAACTSGB (m-attB) (SEQ ID N0:42); 

c) GTTCAGCTTTCKTRTACNAACTSGB (m-attR) (SEQ ID N0:43); 

d) AGCCWGCTTTCKTRTACNAAGTSGB (m-attL) (SEQ ID N0:44); 

e) GTTCAGCTTTYKTRTACNAAGTSGB (m-attPI) (SEQ ID NO:45); 
15 f) AGCCTGCTTTTTTGTACAAACTTGT (attBI) (SEQ ID N0:46); 

g) AGCCTGCTTTCTTGTACAAACTTGT {attB2) (SEQ ID N0:47); 

h) ACCCAGCTTTCTTGTACAAACTTGT (attB3) (SEQ ID N0:48); 

i) GTTCAGCTTTTTTGTACAAACTTGT (attRI) (SEQ ID N0:49); 
j) GTTCAGCTTTCTTGTACAAACTTGT (attR2) (SEQ ID N0:50); 

20 k) GTTCAGCTTTCTTGTACAAAGTTGG (attR3) (SEQ ID N0:51); 

I) AGCCTGCTTTTTTGTACAAAGTTQG (attLI) (SEQ ID NO:52); 

m) AGCCTQCTTTCTTQTACAAAGTTGG (attL2) (SEQ ID NO:53); 

n) ACCCAGCTTTCTTGTACAAAGTTGG (attL3) (SEQ ID NO:54); 

o) GTTCAGCTTTTTTGTACAAAGTTGG (attPI) (SEQ ID NO:55); 
25 p) GTTCAGCTTTCTTGTACAAAGTTGG (attP2, P3) (SEQ ID NQ: 

" 56); 

and a corresponding or complementary DNA or RNA sequence, 
wherein R = A or G, K=G or T/U, Y = C or T/U, W = A or T/U, N = A or C 
or G or T/U, S = C or G, and B = C or G or T/U; and 
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the core region does not contain a stop codon in one or more 
reading frames. 

44. The method of claim 33, wherein the first and/or second 
recombination site comprises at least a first nucleic acid sequence 
selected from the group consisting of a mutated att recombination site 
containing at least one mutation that enhances recombinational 
specificity, a complementary DNA sequence thereto, and an RNA 
sequence corresponding thereto. 

45. The method of claim 33, wherein the vector comprising the 
second site further encodes at least one selectable marker. 

46. The method of claim 45, wherein the marker is a 
promoterless marker, which, upon recombination is under the control of a 
promoter and is thereby expressed. 

47. The method of claim 46, wherein the first recombination site 
is attP and is in the sense orientation prior to recombination. 

48. The method of claim 46, wherein the selectable marker is 
selected from the group consisting of ah antibiotic resistance gene, and a 
detectable protein, wherein the detectable protein is chromogenic, 
fluorescent, or capable of being bound by an antibody and FACs sorted. 

49. The method of claim 48, wherein the selectable marker is 
selected from the group consisting of green fluorescent protein (GFP), red 
fluorescent protein (RFP), blue fluorescent protein (BFP), and E. coli 
histidinol dehydrogenase (hisD). 

50. A cell comprising, the chromosome of claim 1 . 

51 • The cell of claim 50, wherein the cell is a nuclear donor cell. 

52. The cell of claim 50, wherein the cell is a stem cell. 

53. The stem cell of claim 52, wherein said stem cell is human 
and is selected from the group consisting of a mesenchymal stem cell, a 
hematopoietic stem cell, an adult stem cell and an embryonic stem cell. 
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54, The cell of claim 50, wherein the cell is mammalian. 

55- The cell of claim 54, wherein the mammal is selected from 
the group consisting of humans, primates, cattle, pigs, rabbits, goats, 
sheep, mice, rats, guinea pigs, hamsters, cats, dogs, and horses. 
5 56. The cell of claim 50, wherein the cell is a plant cell. 

57. A cell comprising the platform ACes of claim 14. 

58. The cell of claim 57, wherein the cell is a nuclear donor cell. 

59. The cell of claim 57, wherein the cell is a stem celt. 

60. The stem cell of claim 59, wherein said stem cell is human 
10 and is selected from the group consisting of a mesenchymal stem cell, a 

hematopoietic stem cell, an adult stem cell and an embryonic stem cell. 

61 . A human mesenchymal cell comprising an artificial 
chromosome. 

62. The human mesenchymal cell of claim 61, wherein said 
15 artificial chromosome is an ACes. 

63. The human mesenchymal cell of claim 62, wherein the ACes 
is a platform-ACes. 

64. A method for introducing heterologous nucleic acid into the 
mesenchymal cell of claim 63, comprising: 

20 (a) introducing into the cell of claim 63, wherein the platform->4 Ces 

has a first recombination site, a vector comprising at least a second 

recombination site and the heterologous nucleic acid; 

(b) incubating the resulting mixture in the presence of at least one 

recombination protein under conditions whereby recombination between 
25 the first and second recombination sites is effected, thereby introducing 

the heterologous nucleic acid into the platform->4 Ces within the 

mesenchymal ceil. 

65. A lambda-intR mutein comprising a glutamic acid to arginine 
change at position 1 74 of wild-type lambda-intR. 
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66. The lambda-intR mutein of claim 65, wherein the lambda-intR 
mutein comprises SEQ ID NO:37. 

67. The method of claim 46 wherein the promoterless marker is 
transcriptionally downstream of the heterologous nucleic acid, wherein 

5 the heterologous nucleic acid encodes a heterologous protein, and 

wherein the expression level of the selectable marker is transcriptionally 
linked to the expression level of the heterologous protein. 

68. The method of claim 67, wherein the selectable marker and 
the heterologous nucleic acid are transcriptionally linked by the presence 

10 of a IRES between them. 

69. The method of claim 68, wherein the selectable marker is 
selected from the group consisting of an antibiotic resistance gene, and a 
detectable protein, wherein the detectable protein is chromogenic or 
fluorescent. 

15 70. The method of claim 69, wherein the selectable marker is 

selected from the group consisting of green fluorescent protein (GFP), red 
fluorescent protein (RFP), blue fluorescent protein (BFP), and E coli 
histidinol dehydrogenase. 

71 . The method of claim 67 further comprising expressing the 
20 heterologous protein and isolating the heterologous protein. 

72. A method for producing a transgenic animal, comprising 
introducing a platform-/l Ces into an embryonic cell. 

73. The method of claim 72, wherein the embryonic cell is a 
stem cell. 

25 74. The method of claim 72, wherein the embryonic cell is in an 

embryo. 

• 75. The method of claim 72, wherein the platform-ZlCes 
comprises heterologous nucleic acid that encodes a therapeutic product. 
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76. The method of claim 72, wherein the transgenic animal is a 
fish, insect reptile, amphibians, arachnid or mammal. 

77. The method of claim 72, wherein the ACes is introduced by 
cell fusion, lipid-mediated transfection by a carrier system, microinjection, 

5 microcell fusion, electroporation, microprojectile bombardment or direct 
DNA transfer. 

78. A transgenic animal produced by the method of claim 72. 

79. A cell line useful for making a library of ACes, comprising a 
multiplicity of heterologous recombination sites randomly integrated 

10 throughout the endogenous chromosomes. 

80. A method of making a library of ACes comprising random 
portions of a genome, comprising introducing one or wore ACes into the 
cell line of claim 79, under conditions that promote the site-specific 
chromosomal arm exchange of the ACes into, and out of, a multiplicity of 

1 5 the heterologous recombination sites within the cell's chromosomal DNA; 
and isolating said multiplicity oi ACes, thereby producing a library of 
ACes whereby multiple ACes have different portions of the genome 
within. 

81 . A library of cells useful for genomic screening, said library 
20 comprising a multiplicity of cells, wherein each cell comprises an ACes 

having a mutually exclusive portion of a chromosomal nucleic acid 
therein. 

82. The library of cells of claim 81, wherein the cells of the 
library are from a different species than the chromosomal nucleic acid 

25 within the ACes. 

83. A method of making one or more cell lines, comprising 

a) integrating into endogenous chromosomal DNA of a selected cell 
species, a multiplicity of heterologous recombination sites. 
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b) introducing a multiplicity oiACes under conditions that promote 
the site-specific chromosomal arm exchange of the ACes into, and out of, 
a multiplicity of the heterologous recombination sites integrated within the 
cell's endogenous chromosomal DNA; 

c) isolating said multiplicity of ACes, thereby producing a library of 
ACes whereby a multiplicity of ACes have mutually exclusive portions of 
the endogenous chromosomal DNA therein; 

d) introducing the isolated multiplicity of ACes of step c) into a 
multiplicity of cells, thereby creating a library of cells; 

e) selecting different cells having mutually exclusive ACes therein 
and clonally expanding or differentiating said different ceils into clonal cell 
cultures, thereby creating one or more cell lines. 

84. The method of claim 23, wherein the nucleic acid molecule 
with a recombination site is a PGR product. 

85. Method of claim 23 wherein the recombinase is a protein and 
the recombination event occurs in vitro. 

86. The method of claim 33, wherein the vector is a PGR 
product comprising a second recombination site. 

87. The lambda-intR mutein of claim 65, wherein the mutein 
further comprises an amino acid signal for nuclear localization. 

88. The lambda-intR mutein of claim 65, wherein the mutein 
further comprises an epitope tag for protein purification. 

89. A modified iron-induced promoter comprising SEQ ID 
NO:128. 

90. A plasmid or expression cassette comprising the promoter of 
claim 89. 

91. A vector, comprising: 

a recognition site for recombination; and 



-149- 

a sequence of nucleotides that targets the vector to an 
amplifiable region of a chromosome. 

92. The vector of claim 91, wherein the amplifiable region 
comprises heterochromatic nucleic acid. 
5 93. The vector of claim 91, wherein the amplifiable region 

comprises rDNA. 

94. The vector of claim 93, wherein the rDNA comprises an 
intergenic spacer. 

95. The vector of claim 91, further comprising nucleic acid 
10 encoding a selectable marker that is not operably associated with any 

promoter. 

96. The vector of claim 91, wherein the chromosome is a 
mammalian chromosome. 

97. The vector of claim 91, wherein the chromosome is a plant 
1 5 chromosome. 

98. A cell of claim 57 that is a plant cell, wherein the ACes 
platform is a MAC. 

99. The plant cell of claim 98, wherein the MAC comprises 
transcriptional regulatory sequence of nucleotides derived from plants. 

20 100. The plant cell of claim 99, wherein the regulatory sequence 

is selected from the group consisting of promoters, terminators, 
enhancers, silencers and transcription factor binding sites. 

101 . A cell of claim 57 that is an animal cell, wherein the ACes 
platform is a plant artificial chromosome (PAC). 

25 102. The cell of claim 101 that is a mammalian cell. 

103. The cell of claim 98, wherein the MAC comprises 
transcriptional regulatory sequence of nucleotides derived f rom* plants. 

104. The cell of claim 102, wherein the MAC comprises 
transcriptional regulatory sequence of nucleotides derived from plants. 
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105. The cell of claim 104, wherein the regulatory sequence is 
selected from the group consisting of promoters, terminators, enhancers, 
silencers and transcription factor binding sites. 

106. A method, comprising: 

5 introducing a vector of claim 91 into a cell; 

growing the cells; and 

selecting a cell comprising an artificial chromosome that comprises 
one or more repeat regions. 

107. The method of claim 106, wherein sufficient portion of the 
10 vector integrates into a chromosome in the cell to result in amplification 

of chromosomal DNA. 

108. The method of claim 106, wherein the artificial chromsome 
is an ACes. 

109. A method for screening, comprising: 

15 contacting a cell comprising a reporter ACes with test compounds 

or known compounds, wherein: 

the reporter ACes comprises one or a plurality of reporter 
constructs; 

a reporter construct comprises a reporter gene in operative linkage 
20 with a regulatory region responsive to test or known compounds; and 

detecting any increase or decrease in signal output from the 
reporter, wherein a change in the signal is indicative of activity of the test 
or known compound on the regulatory region. 

1 10. The method of cliaim 109, wherein the reporter is dperatively 
25 linked to a promoter that controls expression of a gene in a signal 

transduction pathway, whereby activation oi' reduction in the signal 
indicates that the pathVvay is activated or down-regulated by the test 
compound. 
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111. The method of claim 1 09, wherein the reporter In the 
construct encodes drug resistance or encodes a fluorescent protein. 

112. The method of claim 111, wherein the fluorescent protein Is 
selected from the group consisting of red, green and blue fluorescent 

5 proteins. 

113. The method of claim 109, wherein the ACes comprises a 
plurality of reporter-linked constructs, each with a different reporter, 
whereby the pathway(s) affected by the test compounds can be 
elucidated. 

10 114. The method of claim 109, wherein a reporter is operatively 

linked to a promoter that is transcriptionally regulated in resopnse to DNA 

damage, and the test compounds are genotoxicants, 

115. The method of claim 114, wherein the DNA damage is 

induced by apoptosis, necrosis or cell-cycle perturbations. 
15 116. The method of claim 1 14, wherein unknown compounds are 

screened to assess whether they are genotoxicants. 

117. The method of claim 114, wherein the promoter is a 
cytochrome P450-profiled promoter. 

118. The method of claim 114, wherein the cell is in a transgenic 
20 animal and toxicity is assessed in the animal. 

119. The method of claim 109, wherein: 

the cell is a patient cell sample; the patient has a disease; 

the regulatory region-is one targeted by a drug or drug regimen; 

and 

25 the method assesses the* effectiveness of a treatment for the 

disease for the particular patient. 

120. The method of claim 1 19, wherein the cell is a tumor cell.' 

121 . The method of claim 109, wherein the cell is a stem cell or a 
progenitor cell, whereby expression of the reporter is operatively linked to 
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a regulatory region exprssed in the cells to thereby identify stem cells or 
progenitor cell. 

122. The method of claim 109, wherein the cell is in an animal; 
and the method comprises whole-body imaging to monitor expression of 

5 the reporter in the animal. 

123. A reporter ACes comprises one or a plurality of reporter 
constructs, wherein the reporter construct comprises a reporter gene in 
operative linkage with a regulatory region responsive to test or known 
compounds. 
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